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Sequence Comparison A 

RESULT 2 
ABC1 MOUSE 



ID ABCl_MOUSE STANDARD; PRT; 2261 AA. 

AC P41233; 

DT 01-FEB-1995 (Rel . 31, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE ATP-binding cassette, sub- family A, member 1 (ATP-binding cassette 

DE transporter 1) (ATP-binding cassette 1) (ABC-1) . 

GN ABCA1 OR ABC1 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=DBA/2 ; TISSUE^Macrophage ; 

RX MEDLINE=94375008; PubMed=8088782 ; 

RA Luciani M.F., Denizot F. , Savary S., Mattei M.-G., Chimini G.; 

RT "Cloning of two novel ABC transporters mapping on human chromosome 

RT 9 . " ; 

RL Genomics 21:150-159(1994). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; 

RX MEDLINE=21251004 ; PubMed=113 52567 ; 

RA Qiu Y . , Cavelier L., Chiu S., Yang X., Rubin E . , Cheng J.-F.; 

RT "Human and mouse ABCA1 comparative sequencing and transgenesis 

RT studies revealing novel regulatory sequences."; 

RL Genomics 73:66-76(2001). 

CC -!- FUNCTION: CAMP -DEPENDENT AND SULFONYLUREA- SENSITIVE ANION 

CC TRANSPORTER. KEY GATEKEEPER INFLUENCING INTRACELLULAR CHOLESTEROL 

CC TRANSPORT (BY SIMILARITY) . 

CC -!- TISSUE SPECIFICITY: WIDELY EXPRESSED IN ADULT TISSUES. HIGHEST 
CC LEVELS ARE FOUND IN PREGNANT UTERUS AND UTERUS. 

CC -!- DOMAIN: MULTIFUNCTIONAL POLYPEPTIDE WITH TWO HOMOLOGOUS HALVES , 
CC EACH CONTAINING AN HYDROPHOBIC MEMBRANE -ANCHORING DOMAIN AND AN 

CC ATP BINDING CASSETTE (ABC) DOMAIN. 

CC -!- PTM: Phosphorylation on Ser-2054 regulates phospholipid efflux (By 
CC similarity) . 

CC -!- SIMILARITY: BELONGS TO THE ABC TRANSPORTER FAMILY. ABCA SUBFAMILY. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; X75926; CAA53530.1; ALT_INIT . 

DR EMBL; AF287263; AAG3 9073.1; ALT_INIT . 

DR MGD; MGI: 99607; Abcal . 

DR GO; GO =0008203; P : cholesterol metabolism; IDA. 

DR GO; GO: 0030301; P : cholesterol transport; IDA. 

DR InterPro; IPR003593; AAA_ATPase . 

DR InterPro; IPR003439; ABC_transporter . 

DR Pfam; PF00005; ABC_tran; 2. 

DR ProDom; PD000006; ABC_transporter ; 2. 

DR SMART; SM00382; AAA; 2. 

DR PROSITE; PS00211; ABC_TRANSPORTER_l ; 1. 

DR PROSITE; PS50893; ABC_TRANSPORTER_2 ; 2. 

KW ATP-binding; Glycoprotein; Transmembrane; Transport; Phosphorylation. 

FT TRANSMEM 26 4 2 POTENTIAL. 

FT TRANSMEM 64 0 656 POTENTIAL. 

FT TRANSMEM 690 706 POTENTIAL. 

FT TRANSMEM 717 733 POTENTIAL. 

FT TRANSMEM 74 9 765 POTENTIAL. 



FT 


TRANSMEM 


771 


787 


POTENTIAL 






FT 


TRANSMEM 


1041 


1057 


POTENTIAL 






FT 


TRANSMEM 


1351 


1367 


POTENTIAL 






FT 


TRANSMEM 


1661 


1677 


POTENTIAL 






FT 


TRANSMEM 


1708 


1724 


POTENTIAL 






FT 


TRANSMEM 


1737 


1753 


POTENTIAL 






FT 


TRANSMEM 


1775 


1791 


POTENTIAL 






FT 


TRANSMEM 


1854 


1870 


POTENTIAL 






FT 


NP BIND 


933 


940 


ATP (POTENTIAL) . 




FT 


NP_BIND 


1946 


1953 


ATP (POTENTIAL) . 




FT 


MOD_RES 


1042 


1042 


PHOSPHORYLATION (BY 


PKA) (MAJOR) (BY 


FT 








SIMILARITY) . 




FT 


MOD_RES 


2054 


2054 


PHOSPHORYLATION (BY 


PKA) (MAJOR) (BY 


FT 








SIMILARITY) . 




FT 


CARBOHYD 


14 


14 


N-LINKED 


(GLCNAC. . 


. ) (POTENTIAL) . 


FT 


CARBOHYD 


98 


98 


N- LINKED 


(GLCNAC. . 


. ) (POTENTIAL) . 


FT 


CARBOHYD 


151 


151 


N-LINKED 


(GLCNAC. . 


. ) (POTENTIAL) . 


FT 


CARBOHYD 


161 


161 


N-LINKED 


(GLCNAC. . 


. ) (POTENTIAL) . 


FT 


CARBOHYD 


196 


196 


N-LINKED 


(GLCNAC. . 


. ) (POTENTIAL) . 


FT 


CARBOHYD 


244 


244 


N-LINKED 


(GLCNAC. . 


. ) (POTENTIAL) . 


FT 


CARBOHYD 


292 


292 


N-LINKED 


(GLCNAC. . 


. ) (POTENTIAL) . 


FT 


CARBOHYD 


337 


337 


N-LINKED 


(GLCNAC. . 


. ) (POTENTIAL) . 


FT 


CARBOHYD 


349 


349 


N-LINKED 


(GLCNAC. . 


. ) (POTENTIAL) . 


FT 


CARBOHYD 


400 


400 


N-LINKED 


(GLCNAC. . 


. ) (POTENTIAL) . 


FT 


CARBOHYD 


478 


478 


N-LINKED 


(GLCNAC . . 


. ) (POTENTIAL) . 


FT 


CARBOHYD 


489 


489 


N-LINKED 


(GLCNAC. . 


. ) (POTENTIAL) . 


FT 


CARBOHYD 


521 


521 


N-LINKED 


(GLCNAC. . 


. ) (POTENTIAL) . 


FT 


CARBOHYD 


820 


820 


N-LINKED 


(GLCNAC. . 


. ) (POTENTIAL) . 


FT 


CARBOHYD 


1144 


1144 


N-LINKED 


(GLCNAC . . 


. ) (POTENTIAL) . 


FT 


CARBOHYD 


1294 


1294 


N-LINKED 


(GLCNAC. . 


. ) (POTENTIAL) . 


FT 


CARBOHYD 


1453 


1453 


N-LINKED 


(GLCNAC. . 


. ) (POTENTIAL) . 


FT 


CARBOHYD 


1499 


1499 


N-LINKED 


(GLCNAC. . 


. ) (POTENTIAL) . 


FT 


CARBOHYD 


1504 


1504 


N-LINKED 


(GLCNAC. . 


. ) (POTENTIAL) . 


FT 


CARBOHYD 


1637 


1637 


N-LINKED 


{GLCNAC. . 


.) (POTENTIAL) . 


FT 


CARBOHYD 


2044 


2044 


N-LINKED 


(GLCNAC. . 


. ) (POTENTIAL) . 


FT 


CARBOHYD 


2238 


2238 


N-LINKED 


(GLCNAC. . 


.) (POTENTIAL) . 


FT 


CONFLICT 


1567 


1568 


MISSING (IN REF. 2) . 




FT 


CONFLICT 


2024 


2024 


MISSING (IN REF. 2) . 




SQ 


SEQUENCE 


2261 


AA; 254011 


MW; FAE62B21FD1D09F9 


CRC64; 



Query Match 95.1%; Score 10906; DB 1; Length 2 261; 

Best Local Similarity 94.8%; Pred. No. 0; 

Matches 2087; Conservative 54; Mismatches 60; Indels 0; Gaps 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



1 MPSAGTLPWVQGIICNANNPCFRYPTPGEAPGWGNFNKSIVARLFSDARRLLLYSQKDT 60 

II M I II II II II I M M II II 1 1 NNN IN INN II I NN N IN I 1 1 1 N Nil 

61 MPSAGTLPWVQGIICNANNPCFRYPTPGEAPGWGNFNKSIVSRLFSDAQRLLLYSQRDT 12 0 

61 SMKDMRKVLRTLQQIKKSSSNLKLQDFLVDNETFSGFLYHNLSLPKSTVDKMLRADVILH 120 

I - I I f I I I I hill : I I I I I I I I I I I I I I I I I I I I I I I I h II I I : I : : I I 

121 SIKDMHKVLRMLRQIKHPNSNLKLQDFLVDNETFSGFLQHNLSLPRSTVDSLLQXNVGLQ 180 

121 KVFLQGYQLHLTSLCNGSKSEEMIQLGDQEVSELCGLPREKLAAAERVLRSNMDILKPIL 180 

lllllllllll lllllll Ihlllll Ml MlllhM llllill llllllh: 
181 KVFLQGYQLHLASLCNGSKLEEIIQLGDAEVSALCGLPRKKLDAAERVLRYNMDILKPW 24 0 

181 RTLNSTSPFPSKELAEATKTLLHSLGTLAQELFSMRSWSDMRQEVMFLTNVNSSSSSTQI 24 0 

Mill h = Mill II II! IMMM Ml II II IN I I II I I I II INI II 
241 TKLNSTSHLPTQHLAEATTVLLDSLGGLAQELFSTKSWSDMRQEVMFLTNVNSSSSSTQI 300 

241 YQAVSRIVCGHPEGGGLKIKSLNWYEDNNYKALFGGNGTEEDAETFYDNSTTPYCNDLMK 300 

II Ml II 1 1 IMI I II Nil II 1 1 1 II II II MMM MM MIIMIIIIIIIIIM 

301 YQAVSRIVCGHPEGGGLKIKSLNWYEDNNYKALFGGNNTEEDVDTFYDNSTTPYCNDLMK 360 



301 



360 



NLESSPLSRIIWKALKPLLVGKILYTPDTPATRQVMAEVNKTFQELAVFHDLEGMWEELS 

IM IN 1 1 1 1 M 1 1 1 1 1 1 1 Ml 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml 1 1 1 1 II 1 1! 1 1 Ml MINI 

361 NLESSPLSRIIWKALKPLLVGKILYTPDTPATRQVMAEVNKTFQELAVFHDLEGMWEELS 420 



Qy 

Db 



361 PKIWTFMENSQEMDLVRMLLDSRDNDHFWEQQLDGLDWTAQDIVAFLAKHPEDVQSSNGS 420 

M 1 1 1 II M 1 1 M 1 1 1 1 1 1 M I M II IMM II MM I IMI II INI I II II III 

4 21 PQIWTFMENSQEMDLVRTLLDSRGNDQFWEQKLDGLDWTAQDIMAFLAKNPEDVQSPNGS 4 80 




Qy 


421 


VYTWREAFNETNQAIRTISRFMECVNLNKLEPIATEVWLINKSMELLDERKFWAGIVFTG 


480 




MM 1 1 1 II II 1 1 1 1 M 1 MM MM 1 1 II II 1 IN 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 




Db 


481 


t rv/nnTiirj T? t\ t?xTT?<T»'kt^>T\ TATTODCMDn/MT KTVT c D T DT C \ rD T TMVCMTTT T F^E 1 D V ITTaJ 7A f"* T WIT T 

V YTWKEArN&lNyAiy I J.oKr MhiL. VW LiNJN.ijEir'l c I £> vKLtXfJlv&rir*ijJ_iUnjKl\.r WrtkjA vr 


54 0 


Qy 


481 


ITPGSIELPHHVKYKIRMDIDNVERTNKIKDGYWDPGPRADPFEDMRYVWGGFAYLQDW 


54 0 




ill mini iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiniiiiiiiiiiiiii 




Db 


541 


ITPDSVELPHHVKYKIRMDIDNVERTNKIKDGYWDPGPRADPFEDMRYVWGGFAYLQDW 


600 


Qy 


541 


EQA 1 1 R VLTGT E KKTG VYMQQM P Y P C YVDD I F LR VM SR SM P L FMT LAW I YS VA V 1 1 KG I V 


600 






II II 1 MM IMI 1 1 1 1 hi II MMIMI MIMI II II 1 1 1 IMI II 1 1 1 M 1 1 1 II 




Db 


601 


EQAIIRVLTGSEKKTGVxvQQMPYPCYVDDIr LiKVMbKbMrLir Ml LtAWl lovAvllKblv 


0 o u 


Qy 


601 


YEKEARLKETMRIMGLDNSILWFSWFISSLIPLLVSAGLLWILKLGNLLPYSDPSWFV 


660 




II II 1 1 III 1 1 1 1 1 1 1 1 1 1 III MIMI II II III II II 1 M II MM 1 M M II M II 




Db 


661 


YEKEARLKETMRIMGLDNGILWFSWFVSSLI PLLVSAGLljVVILJvJjtjNblj^iolJFo V vr V 


TOO 
f ZKJ 


Qy 


661 


FLSVFAWTILQCFLISTLFSRANLAAACGGIIYFTLYLPYVLCVAWQDYVGFTLKIFAS 


720 






II !M 1 MM 1 1 1 1 1 1 II II 1 Ml 1 II 1! II II 1 MM II 1 M II MM 1 M MM IMI 




Db 


721 


FLSVFAMVTILQCFLISTLFSRANLAAACGGI I Yr TLYLPYVIjLVAWyDY vbr blKlr Ab 


/oU 


Qy 


721 


LLSPVAFGFGCEYFALFEEQGIGVQWDNLFESPVEEDGFNLTTSVSMMLFDTFLYGVMTW 


780 




Mill MM II M 1 1 1 II IMMMMM mIM M Mill 1 1 MM 1 II 1 MMI II M 




Db 


781 


LLSPVAFGFGCEYFALFEEQGIGVQWDNLFEbPVbbDCjrNijI 1 AVJbMMljr Ul r LiOVMl W 




Qy 


781 


YIEAVFPGQYGIPRPWYFPCTKSYWFGEESDEKSHPGSNQKRISEICMEEEPTHLKLGVS 


84 0 






IIIIIIIIIIIIIIIMIIIIIIIIIIII lllllllhll UNI II II 1 IIM II 




Db 


841 


YIEAVFPGQYGIPRPWYFPCTKSYWr GEEIDEKfanPGbbOKOVbblUMbrjbFlnljKlj^vb 


y uu 


Qy 


841 


IQNLVKVYRDGMKVAVDGLALNFYEGQITSFLGHNGAGKTTTMSILTGLFPPTSGTAYIL 


900 






IMIIIIIIIIIIIMIIIMIIIIIIIIIMIIIIIMIMIIIMIIIIIIIIIIIII 




Db 


901 


IQNLVKViRDGMKVAVDGliAijNr Ybbyi I br J_itjrtN(jA(jJ\l 1 IMblLiUjijr FrloblAilJj 


y b u 


Qy 


901 


GKDIRSEMSTIRQNLGVCPQHNVLFDMLTVEEHIWFYARLKGLSEKHVKAEMEQMALDVG 


960 


Db 


961 


!|ii MMMM , M M I M M III M III! 1 MM 1! IIM III M Mi M II M M M 

GKDIRSEMSSIRQNLGVCPQHNVLFDMLTVEEnlWFYARL 


102 0 


Qy 


961 


LPSSKLKSKTSQLSGGMQRKLSVALAFVGGSKWILDEPTAGVDPYSRRGIWELLLKYRQ 


1020 






II 1 M, MIM II 1 1 1 1 1 III Ml IMIMI 1 MMMM M M III M M M II M M 




Db 


1021 


LP PSKLKS KTSQLSGGMQRKLS VALAF VGGbKV V 1 huh P I AO VDP YbKKQjIWbijijJjKiKy 


1080 


Qy 


1021 


GRTI ILSTHHMDEADVLGDRI AI ISHGKLCCVGSSLFLKNQLGTGYYLTLVKKDVESSLS 


1080 






II III 1! 1 MM II 1 Ml II III M IMIMI M IMMM 1 II Ml MM MMI M M 




Db 


1081 


GRTIILSTHHMDEADILGDRIAIISHGKLCCVGSSLFLKNQLGTGYYLTLVKKDVESSLS 


1140 


Qy 


1081 


SCRNSSSTVSYLKKEDSVSQSSSDAGLGSDHESDTLTIDVSAISNLIRKHVSEARLVEDI 


1140 






llllllllll M M 1 M II II II Ml Mill 1 MUM M I M II M 1 M Ml II II 1 1 




Db 


1141 


SCRNSSSTVSCLKKEDSVSQbbSDAGLGbDHEbDI LI 1U VbAibNLilKKii VbEiAKLiVbDl 


1200 


Qy 


1141 


GHELTYVLPYEAAKEGAFVELFHEIDDRLSDLGISSYGISETTLEEIFLKVAEESGVDAE 


1200 




III II MM IMI II II 1 II II 1 1 II 1 M IIM III MM 1 1 III MM M II Mill 1 




Db 


1201 


GHELTYVLPYEAAKEGAFVELFHEIDDRLSDLGISSYGISETTLEEIFLKVAEESGVDAE 


1260 


Qy 


1201 


TSDGTLPARRNRRAFGDKQSCLRPFTEDDAADPNDSDIDPESRETDLLSGMDGKGSYQVK 


1260 






III II MM MM II III 1 II 1 lllllll 1 1 IMIMI 1 1 III MM M IMIMI M 




Db 


1261 


TSDGTLPARRNRRAFGDKQSCLHPFTEDDAVDPNDSDIDPESRETDLLSGMDGKGSYQLK 


1320 


Qy 


1261 


GWKLTQQQFVALLWKRLLIARRSRKGFFAQIVLPAVFVCIALVFSLIVPPFGKYPSLELQ 


1320 






llllllllllllllll IIIIIIIIMMIIIIIIIIIIIIIIIIIIIIII1IIIIIII 




Db 


1321 


GWKLTQQQFVALLWKRLLIARRSRKGFFAQIVLPAVFVCIALVFSLIVPPFGKYPSLELQ 


1380 


Qy 


1321 


PWMYNEQYTFVSNDAPEDTGTLELLNALTKDPGFGTRCMEGNPIPDTPCQAGEEEWTTAP 


1380 






IMI III IMMM II II II 1 1 II 1 MM M 1 II II 1 II 1 II II 1 1 1 IMIMI M 




Db 


1381 


PWMYNEQYTFVSNDAPEDMGTQELLNALTKDPGFGTRCMEGNPIPDTPCLAGEEDWTISP 


1440 


Qy 


1381 


VPQTIMDLFQNGNWTMQNPSPACQCSSDKIKKMLPVCPPGAGGLPPPQRKQNTADILQDL 

M 1 M M II 1 1 II II 1 M 1 1 III 1 II II II II 1 II II 1 1 1 M III II 1 1 1 IMMM 


1440 


Db 


1441 


VPQS I VDLFQNGNWTMKNPS PACQCSSDKI KKMLPVCPPGAGGLPPPQRKQKTADILQNL 


1500 


Qy 


1441 


TGRNI SDYLVKTYVQI I AKSLKNKI WVNEFRYGGFSLGVSNTQALPPSQEVNDATKQMKK 


1500 






1 II 1 1 Mill III II II II 1 1 1 1 1 1 III MMMM IMIMI 1 II 1 1 Mill 1 1 1 1 1 





Db 


1501 


Qy 


1501 


Db 


1561 


Qy 


1561 


Db 


1621 


Qy 


1621 


Db 


1681 


Qy 


1681 


Db 


1741 


Qy 


1741 


Db 


1801 


Qy 


1801 


Db 


1861 


Qy 


1861 


Db 


1921 


Qy 


1921 


Db 


1981 


Qy 


1981 


Db 


2041 


Qy 


2041 


Db 


2101 


Qy 


2101 


Db 


2161 


Qy 


2161 


Db 


2221 



TGRNISDYLVKTYVQIIAKSLKNKIWVNEFRYGGFSLGVSNSQALPPSHEVNDAIKQMKK 1560 
HLKLAKDS SADRFLNSLGRFMTGLDTRNNVKVWFNNKGWHAI S S FLNVINNAI LRANLQK 1560 

III Ihlllllhllllll 1 1 1 h 1 1 1 1 1 1 1 II 1 1 1 1 INI 1 1 1 1 1 1 1 1 i II 1 1 1 1 1 

LLKLTKDTSADRFLS SLGRFMAGLDTKNNVKVWFNNKGWHAI S S FLNVINNAI LRANLQK 162 0 

GENPSHYGITAFNHPLNLTKQQLSEVAPMTTSVDVLVSICVIFAMSFVPASFWFLIQER 1620 

HIM I I II M I I I I I I II II II I II I I I I I I I I I I I I II I I I I I I I I I II M I I I 1 1 
GENPSQYGITAFNHPLNLTKQQLSEVALMTTSVDVLVSICVIFAMSFVPASFWFLIQER 1680 

VSKAKHLQFISGVKPVI YWLSNFVWDMCNYWPATLVI I I FICFQQKSYVSSTNLPVLAL 1680 

Mill II IM M MM II II II III II I Ml IM II II I I IMIM I II II M II II I M 



LLLLYGWSITPLMYPASFVFKI P S T A YWLT S VN L F I G I NG S VAT F VLE L FTDN KLNN I N 174 0 

lllllllll'limilllllllllllllillllll !IIIIIII!IIImIMIIMI 

LLLLYGWSITPLMYPASFVFKIPSTAYWLTSVNLFIGINGSVATFVLELFTNNKLNDIN 1800 
DILKSVFLIFPHFCLGRGLIDMVKNQAMADALERFGENRFVSPLSWDLVGRNLFAMAVEG 1800 

Ml M 1 1 1 1 III 1 1 MM 1 1 1 M II II IMI I li II M II II II 1 1 1 II II IM Ml I M 

DILKSVFLIFPHFCLGRGLIDMVKNQAMADALERFGENRFVSPLSWDLVGRNLFAMAVEG 1860 
WF F L I T VL I Q YRF F I R P R P VN AKL S P LND E DED VRRE RQR I LDGGGQND ILEIKELTKI 1860 

Ml 1 1 1 1 1 Ml II I M M 1 1 1 III I II M I M II IMI IMM 1 1 II II Ml I M I M 

WFFLITVLIQYRFFIRPRPVKAKLPPLNDEDEDVRRERQRI LDGGGQND ILEIKELTKI 1920 
YRRKRKPAVDRI CVGI PPGECFGLLGVNGAGKS STFKMLTGDTTVTRGDAFLNRNS I LSN 1920 

I II M 1 1 Ml M MM MM Ml M II II M I Mill I II IM 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 I 

YRRKRKPAVDRICIGI PPGECFGLLGVNGAGKSTTFKMLTGDTPVTRGDAFLNKNS I LSN 1980 
IHEVHQNMGYCPQFDAITELLTGREHVEFFALLRGVPEKEVGKVGEWAIRKLGLVKYGEK 1980 

I MM II I II M II II M M I II MM II M M Ml M II M I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 

IHEVHQNMGYCPQFDAITELLTGREHVEFFALLRGVPEKEVGKFGEWAIRKLGLVKYGEK 2 04 0 
YAGNYSGGNKRKLSTAMALIGGPPWFLDEPTTGMDPKARRFLWNCALSWKEGRS WLT 204 0 

II II II I II II M II II M I M IM Ml M M M I II M 1 1 1! M Ml M II M M II I 

YASNYSGGNKRKLSTAMALIGGPPWFLDEPTTGMDPKARRFLWNCALSIVKEGRSWLT 2100 



MM MMM II 1 1 1 1 M II M IM MM M MMMI M liM Ml II M II IMM 1 1 



LAFPGSVPKEKHRNMLQYQLPSSLSSLARIFSILSQSKKRLHIEDYSVSQTTLDQVFVNF 2160 

lllllll M II I 1 1 Ml II I M M II I Ml I M 'I IMM! M I M II M MM MM I 

LAFPGSVLKEKHRNMLQYQLPSSLSSLARIFSILSQSKKRLHIEDYSVSQTTLDQVFVNF 2220 
AKDQSDDDHLKDLS LHKNQTWDVAVLTS FLQDEKVKE S YV 2201 

I II II M I M II IE II II II I II M IMMM I Ml II M I 

AKDQSDDDHLKDLS LHKNQTWDVAVLTS FLQDEKVKE SYV 22 61 



Sequence Comparison B 



RESULT 1 
ABC1_HUMAN 

ID ABC1_HUMAN STANDARD; PRT; 2261 AA. 

AC 095477; Q96S56; Q96T85; Q9NQV4 ; Q9UN06; Q9UN07; Q9UN08; Q9UN09; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE ATP-binding cassette, sub- family A, member 1 (ATP-binding cassette 

DE transporter 1) (ATP-binding cassette 1) (ABC-1) (Cholesterol efflux 

DE regulatory protein) . 

GN ABCA1 OR ABC1 OR CERP . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606 ; 

RN [1] 

RP SEQUENCE FROM N . A. 

RX MEDLINE=20345099; PubMed=10884428 ; 

RA Santamarina-Fojo S., Peterson K.M., Knapper C.L., Qiu Y . , 

RA Freeman L.A., Cheng J.-F., Osorio J., Remaley A.T., Yang X.-P., 

RA Haudenschild C.C., Prades C, Chimini G. # Blackmon E.E., 

RA Francois T.L., Duverger N., Rubin E.M., Rosier M., Denefle P., 
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Db 


1861 


WFFLITVLIQYRFFIRPRPVNAKLSPLNDEDEDVRRERQRILDGGGQNDILEIKELTKI 


1920 


Qy 


1861 


YRRKRKPAVDRICVGIPPGECFGLLGVNGAGKSSTFKMLTGDTTVTRGDAFLNRNSILSN 


1920 






Mill II II Mil II II II II II II II M II II 1 1 1 1 II! II II MM 1 1 II Ml Mill 




Db 


1921 


YRRKRKPAVDRI CVG I P PGECFGLLGVNGAGKSSTFKMLTGDTTVTRGDAFLNKNS I LSN 


1980 


Qy 


1921 


IHEVHQNMGYCPQFDAITELLTGREHVEFFALLRGVPEKEVGKVGEWAIRKLGLVKYGEK 


1980 






Mill II II IIIMI II II II II II II II II II 1 1 1 1 IM 1 1 1 1 II II 1 1 II II 1 Mill 




Db 


1981 


IHEVHQNMGYCPQFDAITELLTGREHVEFFALLRGVPEKEVGKVGEWAIRKLGLVKYGEK 


2040 


Qy 


1981 


Y AGNY S GGN KRKL S T AMAL I GG P P W F LD E PTTGMD P KARR F LWNC AL S WKEGR S WLT 


2040 






1 II II MM II II II MM li II II IMI M II 1 1 IM 1 1 Ml 1 II MM MM II II II 




Db 


2041 


YAGNYSGGNKRKLSTAMALIGGPPWFLDEPTTGMDPKARRFLWNCALSWKEGRSWLT 


2100 


Qy 


2041 


SHSMEECEALCTRMAIMVNGRFRCLGSVQHLKNRFGDGYTIWRIAGSNPDLKPVQDFFG 


2100 






1 II II II II II MM II II II II II II Ml M 1 II II 1 1 1 II MM MM II llllll II 




Db 


2101 


SHSMEECEALCTRMAIMVNGRFRCLGSVQHLKNRFGDGYTIWRIAGSNPDLKPVQDFFG 


2160 


Qy 


2101 


LAFPGSVPKEKHRNMLQYQLPSSLSSLARIFSILSQSKKRLHIEDYSVSQTTLDQVFVNF 


2160 






Mill 1 1 II Ml MM li 1 1 1 1 1 1 III 1 II 1 1 II 1 1 1 1 MM II II II 1 1 II 1 1 1 1 II II 




Db 


2161 


LAFPGSVPKEKHRNMLQYQLPSSLSSLARIFSILSQSKKRLHIEDYSVSQTTLDQVFVNF 


2220 


Qy 


2161 


AKDQSDDDHLKDLS LHKNQTWDVAVLTS FLQDEKVKES YV 2201 





Db 2221 AKDQSDDDHLKDLSLHKNQTWDVAVLTSFLQDEKVKESYV 2261 



09786635 Results 
SEQ ID NO: 1 

SUMMARIES 

% 

Result Query 



No . 


Score 


Match 


Length 


DB 


ID 


Description 


1 


6880 


100, 


.0 


6880 


6 


AA2bj4bU 


AX253450 


Sequence 


2 


6880 


100. 


.0 


6880 


9 


HSA012376 


AJ012376 


Homo sapi 


3 


6880 


100, 


,0 


7260 


6 


AX2 534&2 


AX253452 


Sequence 


4 


6879 


100, 


.0 


9497 


6 


» vac n n n 

AX059973 


AX059973 


Sequence 


5 


6879 


100 , 


.0 


9497 


9 


TV CI ^ C 1 O 1 

Ar lob2bl 


AF165281 


Homo sapi 


6 


6875 . 8 


99, 


,9 


9741 


6 


AX127830 


AX127830 


Sequence 


7 


6875 . 8 


99. 


.9 


9741 


6 


AX139817 


AX139817 


Sequence 


8 


6875 . 8 


99, 


.9 


9741 


6 


AX351038 


AX351038 


Sequence 


9 


6875 . 8 


99. 


.9 


9854 


6 


AX127831 


AX127831 


Sequence 


10 


6875 . 8 


99. 


.9 


9854 


6 


AX13 9818 


AX139818 


Sequence 


11 


6870 . 4 


99, 


,9 


7860 


6 


*j\ A ft C ft A 

AX092594 


AX092594 


Sequence 


12 


6865 


99, 


,8 


9495 


6 


TV vn c n A "7 o 

AX05 9 9 78 


AX059978 


Sequence 


13 


6862.4 


99. 


.7 


i n a a ~> 


b 


nAUD V 1 J. J 


AX060713 


Sequence 


14 


6862.4 


99, 


, 7 


10442 


6 


AX060892 


AX060892 


Sequence 


15 


6862 .4 


99. 


. 7 


10442 


9 


AF285167 


AF285167 


Homo sapi 


16 


6860.8 


99. 


. 7 


10474 


6 


AX060719 


AX060719 


Sequence 


17 


6860.8 


99. 


. 7 


10474 


6 


AX060721 


AX060721 


Sequence 


18 


6860.8 


99. 


. 7 


10474 


6 


AX060898 


AX060898 


Sequence 


19 


6860.8 


99. 


,7 


10474 


6 


AX060900 


AX060900 


Sequence 


20 


6845.2 


99. 


.5 


7862 


6 


AX135712 


AX135712 


Sequence 


21 


6760.2 


98, 


,3 


9593 


6 


AX059976 


AX059976 


Sequence 


22 


6711.6 


97, 


,6 


6786 


9 


AB055982 


AB055982 


Homo sapi 


23 


5630 


81. 


.8 


7878 


10 


MMABC1 


X75 92 6 Mus musculu 


24 


5455 .6 


79. 


.3 


6801 


10 


AY208182 


AY208182 Rattus no 


25 


4302 


62 . 


.5 


7074 


5 


AF362377 


AF362377 


Gallus ga 


26 


2052 .4 


29 . 


.8 


5097 


6 


BD012346 


BD012346 


Genes rel 


27 


2052 .4 


29. 


.8 


5097 


6 


BD160225 


BD160225 


Primer fo 


28 


2052.4 


29. 


.8 


5097 


9 


AK027864 


AK027864 


Homo sapi 


29 


1911.6 


27. 


.8 


7298 


9 


AF001945 


AF0 0194 5 Homo sapi 



AF165281 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 

FEATURES 

source 



AF165281 9497 bp mRNA linear PRI 17-AUG-1999 

Homo sapiens ATP cassette binding transporter 1 (ABC1) mRNA, 
complete cds. 
AF165281 

AF165281.1 GI: 5734100 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 9497) 

Rust,S., Rosier ,M., Funke,H., Real , J . , Amoura,Z., Piette,J,C, 

Deleuze, J . F . , Brewer, H.B., Duverger,N. , Denefle,P. and Assmann,G. 

Tangier disease is caused by mutations in the gene encoding 

ATP-binding cassette transporter 1 

Nat. Genet. 22 (4), 352-355 (1999) 

99364413 

10431238 

2 (bases 1 to 9497) 

Rust,S., Rosier,M., Funke,H., Real, J., Amoura,Z., Piette,J,C, 
Deleuze, J. F. , Brewer, H.B., Duverger,N., Denefle,P. and Assmann,G. 
Direct Submission 

Submitted (01- JUL-1999) Genomics, Rhone-Poulenc Rorer, 2 rue Gaston 
Cr#mieux, Evry 91006, France 

Location/Qualifiers 

1. .9497 

/organism= "Homo sapiens" 
/mol_t ype = " mRNA 11 



gene 
CDS 



BASE COUNT 
ORIGIN 



/db_xref ="taxon: 9606" 

/chromosome= " 9 11 

/map="9q31" 

1. .9497 

/gene= n ABCl M 

121. .6726 

/gene="ABCl M 

/note="ABC transporter; ABC1 protein" 
/codon_start=l 

/product="ATP cassette binding transporter 1" 
/protein_id= "AAD4 984 9.1" 
/db_xref="GI : 5734101" 

/ 1 rans 1 at ion= " MPS AGTLPWVQGI I CNANNPCFRYPTPGEAPG WGNFNKS I VAR 
LFSDARRLLLYSQKDTSMKDMRKVLRTLQQIKKSSSNLKLQDFLVDNETFSGFLYHNL 
SLPKSTVDKMLRADVILHKVFLQGYQLHLTSLCNGSKSEEMIQLGDQEVSELCGLPRE 
KLAAAERVLRSNMDI LKP I LRTLNSTS PFPS KELAEATKTLLHS LGTLAQELFSMRSW 
SDMRQEVMFLTNVNSSSSSTQIYQAVSRIVCGHPEGGGLKIKSLNWYEDNNYKALFGG 
NGTEEDAETFYDNSTTPYCNDLMKNLESSPLSRIIWKALKPLLVGKILYTPDTPATRQ 
VMAEVNKTFQELAVFHDLEGMWEELSPKIWTFMENSQEMDLVRMLLDSRDNDHFWEQQ 
LDGLDWTAQDIVAFLAKHPEDVQSSNGSVYTWREAFNETNQAIRTISRFMECVNLNKL 
EPIATEVWLINKSMELLDERKFWAGIVFTGITPGSIELPHHVKYKIRMDIDNVERTNK 
IKDGYWDPGPRADPFEDMRYVWGGFAYLQDWEQAIIRVLTGTEKKTGVYMQQMPYPC 
YVDD I F LR VM S RS M PL FMT LAW I Y S VA V 1 1 KG I VY EKE ARLKE TMR I MG LDN S I LW FS 
WFISSLIPLLVSAGLLWILKLGNLLPYSDPSWFVFLSVFAWTILQCFLISTLFSR 
ANLAAACGG 1 1 YFTLYLPYVLCVAWQDYVGFTLKI FAS LLS PVAFGFGCEYFALFEEQ 
GIGVQWDNLFESPVEEDGFNLTTSVSMMLFDTFLYGVMTWYIEAVFPGQYGIPRPWYF 
PCTKSYWFGEESDEKSHPGSNQKRISEICMEEEPTHLKLGVSIQNLVKVYRDGMKVAV 
DGLALNFYEGQITSFLGHNGAGKTTTMSILTGLFPPTSGTAYILGKDIRSEMSTIRQN 
LGVCPQHNVLFDMLTVEEHIWFYARLKGLSEKHVKAEMEQMALDVGLPSSKLKSKTSQ 
LSGGMQRKLSVALAFVGGSKWILDEPTAGVDPYSRRGIWELLLKYRQGRTIILSTHH 
MDEADVLGDRIAIISHGKLCCVGSSLFLKNQLGTGYYLTLVKKDVESSLSSCRNSSST 
VSYLKKEDSVSQSSSDAGLGSDHESDTLTIDVSAISNLIRKHVSEARLVEDIGHELTY 
VLPYEAAKEGAFVELFHEIDDRLSDLGISSYGISETTLEEIFLKVAEESGVDAETSDG 
TLPARRNRRAFGDKQSCLRPFTEDDAADPNDSDIDPESRETDLLSGMDGKGSYQVKGW 
KLTQQQ F VAL LWKRLL I ARR S R KG F FAQ I VL P A VF VC I ALVF S LIVPPFGKYPSLELQ 
PWMYNEQYTFVSNDAPEDTGTLELLNALTKDPGFGTRCMEGNPIPDTPCQAGEEEWTT 
APVPQTIMDLFQNGNWTMQNPSPACQCSSDKIKKMLPVCPPGAGGLPPPQRKQNTADI 
LQDLTGRNI SDYLVKTYVQI I AKSLKNKI WVNEFRYGGFSLGVSNTQALPPSQEVNDA 
TKQMKKHLKIiAKDSSADRFIiNSI^RFMTGLDTRNNVKWFNNKGWHAISSFLNVINNA 
ILRANLQKGENPSHYGITAFNHPLNLTKQQLSEVAPMTTSVDVLVSICVIFAMSFVPA 
SFWFLIQERVSKAKHLQFISGVKPVIYWLSNFVWDMCNYWPATLVIIIFICFQQKS 
YVSSTNLPVLALLLLLYGWSITPLMYPASFVFKIPSTAYWLTSVNLFIGINGSVATF 
VLELFTDNKLNNINDILKSVFLIFPHFCLGRGLIDMVKNQAMADALERFGENRFVSPL 
SWDLVGRNLFAMAVEGWFFLITVLIQYRFFIRPRPVNAKLSPLNDEDEDVRRERQRI 
LDGGGQNDILEIKELTKIYRRKRKPAVDRICVGIPPGECFGLLGVNGAGKSSTFKMLT 
GDTTVTRGDAFLNRNSILSNIHEVHQNMGYCPQFDAITELLTGREHVEFFALLRGVPE 
KEVGKVGEWAIRKLGLVKYGEKYAGNYSGGNKRKLSTAMALIGGPPWFLDEPTTGMD 
PKARRFLWNCALSWKEGRSWLTSHSMEECEALCTRMAIMVNGRFRCLGSVQHLKNR 
FGDGYTIWRIAGSNPDLKPVQDFFGLAFPGSVPKEKHRNMLQYQLPSSLSSLARIFS 
ILSQSKKRLHIEDYSVSQTTLDQVFVNFAKDQSDDDHLKDLSLHKNQTWDVAVLTSF 
LQDEKVKESYV" 

2600 a 2115 c 2217 g 2564 t 1 others 



Query Match 100.0%; Score 6879; 

Best Local Similarity 100.0%; Pred. No. 0; 
Matches 6879; Conservative 0; Mismatches 



DB 9; Length 94 97; 

1 ; Indels 0 ; Gaps 



0; 



Qy 


i 


Db 


i 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 



CAAACATGTCAGCTGTTACTGGAAGTGGCCTGGCCTCTATTTATCTTCCTGATCCTGATC 6 0 

!l II il II 1 1 III M II INI il Mil M 1 1 1 il II II M Mil I i II M 1 1 1 1 1 II IN 

CAAACATGTCAGCTGTTACTGGAAGTGGCCTGGCCTCTATTTATCTTCCTGATCCTGATC 6 0 

TCTGTTCGGCTGAGCTACCCACCCTATGAACAACATGAATGCCATTTTCCAAATAAAGCC 120 

II II Mill 1 1 II II II MIMMIMMIMM II MM II II II Ml I I M M MM I 

TCTGTTCGGCTGAGCTACCCACCCTATGAACAACATGAATGCCATTTTCCAAATAAAGCC 120 

ATGCCCTCTGCAGGAACACTTCCTTGGGTTCAGGGGATTATCTGTAATGCCAACAACCCC 180 

1 1 1 1 1 II MM 1 1 III 1 1 II Mill 1 1 1 M II 1 1 MM 1 1 1 1 1 MM I M M I M III 1 1 

ATG CCCT CTG C AGG AAC ACTT CCTTGGGTTC AGGGG ATTATCTGT AATG CCAAC AACC C C 180 

TGTTTCCGTTACCCGACTCCTGGGGAGGCTCCCGGAGTTGTTGGAAACTTTAACAAATCC 24 0 



MM llllllll IIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIIII 

Db 181 TGTTTCCGTTACCCGACTCCTGGGGAGGCTCCCGGAGTTGTTGGAAACTTTAACAAATCC 240 

Qy 241 ATTGTGGCTCGCCTGTTCTCAGATGCTCGGAGGCTTCTTTTATACAGCCAGAAAGACACC 3 00 

MM MMMI MMI I MIMMIMMMMMMIM MIMMIM MIM MM 

Db 241 ATTGTGG CT CG CCTGTT CT C AG ATGCTCGG AGGCTTCTTTT AT AC AG C C AG AAAG AC ACC 300 

Qy 301 AG C ATG AAGG AC ATGCG C AAAGTTCTG AG AAC ATT AC AG C AG ATC AAG AAAT CC AG CTC A 360 

1 1! ! II II II 1 1 1 1 1 1 I ! 1 1 II II II I MM 1 1 1 II II ! I! II II 1 1 lil II 1 1 1 II II I 

Db 301 AG C ATG AAGG AC ATG CG C AAAGTTCTG AG AAC ATT AC AG C AG ATC AAG AAAT CC AG CTC A 360 

Qy 361 AACTTGAAGCTTCAAGATTTCCTGGTGGACAATGAAACCTTCTCTGGGTTCCTGTATCAC 42 0 

1 1 1 i i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 361 AACTTGAAGCTTCAAGATTTCCTGGTGGACAATGAAACCTTCTCTGGGTTCCTGTATCAC 42 0 

Qy 421 AACCTCTCTCTCCCAAAGTCTACTGTGGACAAGATGCTGAGGGCTGATGTCATTCTCCAC 4 80 

I II 1 1 1 II IN M II Ml M M II II I II II I M II M MIM II I M 1 1 II M I II II I 

Db 421 AACCTCTCTCTCCCAAAGTCTACTGTGGACAAGATGCTGAGGGCTGATGTCATTCTCCAC 4 80 

Qy 481 AAGGT ATTTTTG C AAGG CT AC C AGTT AC ATTTG AC AAGT CTGTG C AATGG AT C AAAAT C A 54 0 

IM I II IM M I I M I I M Ml II 1 1 M II I' M MM I II II II M Ml M M Ml IM 

Db 4 81 AAGG T ATTTTTG C AAGG C T ACC AGTT AC ATTTG AC AAGTCTGTG C AATGG AT C AAAATC A 54 0 

Qy 541 GAAGAGATGATTCAACTTGGTGACCAAGAAGTTTCTGAGCTTTGTGGCCTACCAAGGGAG 600 

I II I M II M 1 1 MMM MM I! I ! M IM M M 1 1 II II II 'MMI I M M II IM 

Db 541 GAAGAGATGATTCAACTTGGTGACCAAGAAGTTTCTGAGCTTTGTGGCCTACCAAGGGAG 600 

Qy 601 AAACTGGCTG C AG C AG AG CG AG T ACTT CG TT CC AAC ATGG AC ATCCTG AAG C C AATCC TG 660 

I II M I M II I M II II M M I II I M M il M I MM I II il M I M M M M Ml M I 

Db 601 AAACTGGCTGCAGCAGAGCGAGTACTTCGTTCCAACATGGACATCCTGAAGCCAATCCTG 660 

Qy 661 AG AACACT AAACT CT AC AT CT C C CTT CC CG AG C AAGG AG CTGG C CG AAG C C AC AAAAAC A 720 

MMMMMMIMMMMIMMMIMMMMIMMIIIIIIIIIMMIMM 

Db 661 AGAACACTAAACTCTACATCTCCCTTCCCGAGCAAGGAGCTGGCCGAAGCCACAAAAACA 720 

Qy 721 TTGCTGCATAGTCTTGGGACTCTGGCCCAGGAGCTGTTCAGCATGAGAAGCTGGAGTGAC 780 

M I Mi II II I II II II I M I I II 1 1 1 M II II MMI I II II II I MM I M I MMM 

Db 721 T TGCTG C AT AGTCTTGGG ACT C TGG CC C AGG AG CTG TT C AGC ATG AG AAGCTGG AG TG AC 780 

Qy 781 ATGCGACAGGAGGTGATGTTTCTGACCAATGTGAACAGCTCCAGCTCCTCCACCCAAATC 84 0 

I 1 1 1 IMMI I M M 1 1 1 1 1 MM 1 1 1 II M M I MM I II II M MMM| MM 1 1 1 1 

Db 781 ATGCGACAGGAGGTGATGTTTCTGACCAATGTGAACAGCTCCAGCTCCTCCACCCAAATC 84 0 

Qy 841 T AC C AGGC TGTG TCTCG T ATTG T CTGCGGG C AT CCCG AGGGAGGGGGG CTG AAG AT C AAG 900 

M 1 1 II M II 1 1 1 1 1 1 M M II M 1 1 1 II M 1 1 1 MM Ml II M Mi M II I M M I M 

Db 841 T AC CAGGCTGTGTCTCGTATTGT CTGCGGG CAT CCCG AGGGAGGGGGG CTG AAG AT C AAG 900 

Qy 901 TCT C TC AAC TGGT ATG AGG ACAACAACT AC AAAGC C CT CTTTGG AGGC AATGG C ACTG AG 960 

I I M II II MM M MIM M MM M Ml II II M MM M ;l M I II II il 1 1 1 II M I 

Db 901 TCTCTCAACTGGTATGAGGACAACAACTACAAAGCCCTCTTTGGAGGCAATGGCACTGAG 960 

Qy 961 GAAGATGCTGAAACCTTCTATGACAACTCTACAACTCCTTACTGCAATGATTTGATGAAG 1020 

1 1 1 1 M II 1 1 MMMI I M M II II M MM IMIMI M II II III II 1 1 1 MM II I 

Db 961 G AAG ATGCTG AAACCT TC T ATG AC AACT CT AC AACT C CTT ACTGC AATGATTTG ATG AAG 1020 

Qy 1021 AATTTGGAGTCTAGTCCTCTTTCCCGCATTATCTGGAAAGCTCTGAAGCCGCTGCTCGTT 1080 

M 1 1 MM 1 1 1 II II II I MM IMMI MM II Mill II II IMM II Ml M II II I 

Db 1021 AATTTGGAGTCTAGTCCTCTTTCCCGCATTATCTGGAAAGCTCTGAAGCCGCTGCTCGTT 1080 

Qy 1081 GGG AAG AT C C TGT AT AC ACCTG ACAC TCC AGC C ACAAGG C AGG TC ATGGC TG AGGTG AAC 114 0 

1 1 M II II 1 1 1 II M IM M II M 1 1 1 II II 1 1! II II I II IMM IMMI I M M II I 

Db 1081 GGG AAG ATC CTG T AT AC AC CTG AC ACTC C AGCC AC AAGG C AGGT C ATGGCTG AGGTG AAC 114 0 

Qy 1141 AAGACCTTCCAGGAACTGGCTGTGTTCCATGATCTGGAAGGCATGTGGGAGGAACTCAGC 1200 

II 1 1 II 1 1 1 1 1 MM 1 1 1 M MM I II IM II MM IMM II II Mill Ml II II M I 

Db 1141 AAGACCTTCCAGGAACTGGCTGTGTTCCATGATCTGGAAGGCATGTGGGAGGAACTCAGC 1200 

Qy 1201 CCC AAG ATCTGGACCTTC ATGG AG AAC AGCC AAG AAATGGACCTTGTCCGG ATG CTGTTG 1260 

1 1 MINI llllllllllll.il IMIMI III II II MINIUM MINI III INI I 

Db 1201 CCCAAGATCTGGACCTTCATGGAGAACAGCCAAGAAATGGACCTTGTCCGGATGCTGTTG 1260 







• • 




Qy 


1261 


GACAGCAGGGACAATGACCACTTTTGGGAACAGCAGTTGGATGGCTTAGATTGGACAGCC 


1320 




1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 

1 I 1 1 1 1 1 II II II 1 1 II II 1 II II 1 II II II 1 1 1 1 1 M M 1 II M M II II II 1 1 1 1 M 1 




Db 


1261 


GACAGCAGGGACAATGACCACTTTTGGGAACAGCAGTTGGATGGCTTAGATTGGACAGCC 


1320 


Qy 


1321 


CAAGACATCGTGGCGTTTTTGGCCAAGCACCCAGAGGATGTCCAGTCCAGTAATGGTTCT 


1380 




I 1 1 1 1 l II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i t 

II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 




Db 


1321 


CAAGACATCGTGGCGTTTTTGGCCAAGCACCCAGAGGATGTCCAGTCCAGTAATGGTTCT 


1380 


Qy 


1381 


GTGT AC AC CTGG AG AG AAG CTTT C AACG AG ACT AAC C AGG C AAT CCGG ACC AT AT CTCG C 


1440 




i t i i i i l i l l l l l i l l l l 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 
1 1 I 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1381 


GTGT AC AC CTGG AG AG AAG CTTT C AACG AG AC T AAC C AGG C AAT CCGG ACC AT ATCTCG C 


1440 


Qy 


1441 


TTC ATGG AGTGTGTC AAC CTG AAC AAGCT AG AACC C AT AG C AAC AG AAGTCTGG CT C AT C 


1500 




1 1 1 1 1 1 I 1 1 1 I I I I ! 1 I I I I I I 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 
1 I I 1 I I II 1 II 1 1 1 1 i 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 




Db 


1441 


TTC ATGG AGTG TGTC AAC CTG AAC AAG CT AG AAC C CAT AG C AAC AG AAGT CTGG CT CATC 


1500 


Qy 


1501 


AACAAGTCCATGGAGCTGCTGGATGAGAGGAAGTTCTGGGCTGGTATTGTGTTCACTGGA 


1560 




1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t t 1 1 1 1 1 1 1 1 ! 1 1 ! t 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 I II 1 1 1 1 1 1 1 1 1 1! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M II 




Db 


1501 


AACAAGTCCATGGAGCTGCTGGATGAGAGGAAGTTCTGGGCTGGTATTGTGTTCACTGGA 


1560 


Qy 


1561 


ATT ACT C C AGGC AG C ATTG AG CTG CCCC AT C ATGT C AAGT AC AAG ATC CG AATGG AC ATT 
l 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 11 1 1 1 t 1 t 1 1 1 1 I 1 1 


1620 


Db 


1561 


1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 I U 1 1 1 1 1 1 1 
ATT ACT C C AGG CAG C ATTG AG CTG CCCC AT C ATGT CAAGTAC AAG ATCCG AATGG AC ATT 


1620 


Qy 


1621 


GACAATGTGGAGAGGACAAATAAAATCAAGGATGGGTACTGGGACCCTGGTCCTCGAGCT 


1680 




1 \ 1 t 1 1 1 1 I 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 

I 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 ! 1 1 1 1 I 1 1 1 1 1 1 II 1 




Db 


1621 


GACAATGTGGAGAGGACAAATAAAATCAAGGATGGGTACTGGGACCCTGGTCCTCGAGCT 


1680 


Qy 


1681 


GACCCCTTTGAGGACATGCGGTACGTCTGGGGGGGCTTCGCCTACTTGCAGGATGTGGTG 


1740 




i 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 t 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 I I I I I 1 I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 I 1 1 1 1 1 1 1 1 1 I 1 II 1 II 1 1 1 1 1 1 1 1 1 1 




Db 


1681 


GACCCCTTTGAGGACATGCGGTACGTCTGGGGGGGCTTCGCCTACTTGCAGGATGTGGTG 


1740 


Qy 


1741 


GAG C AGG C AATC AT C AGGGTG C TG ACGGG CAC CG AG AAGAAAAC TGG TGT CT AT ATGC AA 


1800 




t 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 




Db 


1741 


G AG C AGGC AATC ATC AGGGTG CTG ACGGG CAC CG AG AAG AAAACTGGTGTCT AT ATGCAA 


1800 


Qy 


1801 


CAGATGCCCTATCCCTGTTACGTTGATGACATCTTTCTGCGGGTGATGAGCCGGTCAATG 


1860 




f 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 

! 1 1 1 1 1 11 1 1 1 1 1 1 ! 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 




Db 


1801 


CAGATGCCCTATCCCTGTTACGTTGATGACATCTTTCTGCGGGTGATGAGCCGGTCAATG 


1860 


Qy 


1861 


CCCCTCTTCATGACGCTGGCCTGGATTTACTCAGTGGCTGTGATCATCAAGGGCATCGTG 


1920 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 




Db 


1861 


CCCCTCTTCATGACGCTGGCCTGGATTTACTCAGTGGCTGTGATCATCAAGGGCATCGTG 


1920 


Qy 


1921 


T ATG AG AAGG AGGC ACGG CTG AAAG AG AC C ATGCGG AT CATGGG C CTGG AC AAC AGC AT C 


1980 




1 1 I t 1 I 1 t 1 t 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 t 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 I 1 1 




Db 


1921 


T ATG AG AAGG AGGC ACGG CTG AAAG AG AC C ATGCGG AT CATGGGC CTGG AC AAC AG CAT C 


1980 


Qy 


1981 


CTCTGGTTTAGCTGGTTCATTAGTAGCCTCATTCCTCTTCTTGTGAGCGCTGGCCTGCTA 


2040 




1 1 I 1 1 1 1 1 1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 ! I I 1 1 1 1 1 1 II It II 1 1 1 1 1 1 1 
1 1 I II 1 II 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 i 1 II 1 1 1 1 1 1 1 1 1 1 It 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1981 


CTCTGGTTTAGCTGGTTCATTAGTAGCCTCATTCCTCTTCTTGTGAGCGCTGGCCTGCTA 


2040 


Qy 


2041 


GTGGTCATCCTGAAGTTAGGAAACCTGCTGCCCTACAGTGATCCCAGCGTGGTGTTTGTC 


2100 




1 1 1 1 1 1 1 t 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 M 1 II 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2041 


GTGGTCATCCTGAAGTTAGGAAACCTGCTGCCCTACAGTGATCCCAGCGTGGTGTTTGTC 


2100 


Qy 


2101 


TTCCTGTCCGTGTTTGCTGTGGTGACAATCCTGCAGTGCTTCCTGATTAGCACACTCTTC 


2160 




1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 t 1 
1 1 1 1 1 1 II I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2101 


TTCCTGTCCGTGTTTGCTGTGGTGACAATCCTGCAGTGCTTCCTGATTAGCACACTCTTC 


2160 


Qy 


2161 


TCCAGAGCCAACCTGGCAGCAGCCTGTGGGGGCATCATCTACTTCACGCTGTACCTGCCC 


2220 






1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 ! II 1 1 1 1 1 1 
1 1 1 1 II II I ! 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2161 


TCC AG AGC C AACCTGG CAG C AGCCTGTGGGGG CATC AT CT AC TTC ACGCTGTAC CTG CCC 


2220 


Qy 


2221 


TACGTCCTGTGTGTGGCATGGCAGGACTACGTGGGCTTCACACTCAAGATCTTCGCTAGC 


2280 




I I I I I I I I I I I I I I I I I I I I I I I I 1 II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 




Db 


2221 


TACGTCCTGTGTGTGGCATGGCAGGACTACGTGGGCTTCACACTCAAGATCTTCGCTAGC 


2280 


Qy 


2281 


CTGCTGTCTCCTGTGGCTTTTGGGTTTGGCTGTGAGTACTTTGCCCTTTTTGAGGAGCAG 


2340 




1 1 III 1 1 1 II II II II II 1 1 III MM IMM 1 1 IMIIIII II 1 1 1 II MM II II 1 II 




Db 


2281 


CTGCTGTCTCCTGTGGCTTTTGGGTTTGGCTGTGAGTACTTTGCCCTTTTTGAGGAGCAG 


2340 




Qy 


2341 


GGCATTGGAGTGCAGTGGGACAACCTGTTTGAGAGTCCTGTGGAGGAAGATGGCTTCAAT 


2400 




MIMMIMMMMMIMMMMMMIMMMMMMI MMIIMIMM! 




Db 


2341 


GGCATTGGAGTGCAGTGGGACAACCTGTTTGAGAGTCCTGTGGAGGAAGATGGCTTCAAT 


24 00 


Qy 


2401 


CTCACCACTTCGGTCTCCATGATGCTGTTTGACACCTTCCTCTATGGGGTGATGACCTGG 


2460 




MMMM MMMMIIIMIIMIMMMMMIMMIMMIMIIMMMM 




Db 


2401 


CTCACCACTTCGGTCTCCATGATGCTGTTTGACACCTTCCTCTATGGGGTGATGACCTGG 


2460 


Qy 


2461 


TACATTGAGGCTGTCTTTCCAGGCCAGTACGGAATTCCCAGGCCCTGGTATTTTCCTTGC 


2520 




MMMM MMMMM IMIIMIIMMIMMIMMIMIIIMI MMMMM 




Db 


2461 


TACATTGAGGCTGTCTTTCCAGGCCAGTACGGAATTCCCAGGCCCTGGTATTTTCCTTGC 


2520 


Qy 


2521 


ACCAAGTCCTACTGGTTTGGCGAGGAAAGTGATGAGAAGAGCCACCCTGGTTCCAACCAG 


2580 




MIIMMIMMMMI IMIIMIMMMMMMMIIMIIMMMMIMM 




Db 


2521 


ACCAAGTCCTACTGGTTTGGCGAGGAAAGTGATGAGAAGAGCCACCCTGGTTCCAACCAG 


2580 


Qy 


2581 


AAG AG AAT ATC AG AAAT CTGC ATGG AGG AGG AACCC AC C C ACTTG AAG CTGGG CGTG TC C 


2640 




I im ! I imi ! II II 1 1 ! 1 ! II II 1 1 1 II M 1 1 1 1: M < : I i:m 1 : 1 1; M 1 1 1 1 i M I 




Db 


2581 


AAG AG AAT ATC AG AAAT CTGC ATGG AGG AGG AAC C CACC C ACTTG AAGCTGGG CG TG TCC 


264 0 


Qy 


2641 


ATT C AG AAC CTGGT AAAAGT CT ACCG AG ATGGG ATG AAGGTGG C TGT CG ATGG C CTGGC A 


2 700 




MMMMMMI MMMMMMMMMMMMMIMMMMMMM IMM 




Db 


2641 


ATT C AG AACCTGGT AAAAGTCT AC CG AG ATGGG ATG AAGGTGG C TGT CG ATGGC CTGGC A 


2 700 


Qy 


2701 


CTGAATTTTTATGAGGGCCAGATCACCTCCTTCCTGGGCCACAATGGAGCGGGGAAGACG 


2760 




MMMMMMI MM MIMMMMMMMMMMMMMMIIMM IMM 




Db 


2701 


CTGAATTTTTATGAGGGCCAGATCACCTCCTTCCTGGGCCACAATGGAGCGGGGAAGACG 


2760 


Qy 


2761 


ACCACCATGTCAATCCTGACCGGGTTGTTCCCCCCGACCTCGGGCACCGCCTACATCCTG 


2820 


Db 


2761 


IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 

ACCACCATGTCAATCCTGACCGGGTTGTTCCCCCCGACCTCGGGCACCGCCTACATCCTG 


2820 


Qy 


2821 


GGAAAAGACATTCGCTCTGAGATGAGCACCATCCGGCAGAACCTGGGGGTCTGTCCCCAG 


2880 




IMMMMMMMM MMMMMMMi iMMMM MM MMMMM MM 




Db 


2821 


GG AAAAG AC ATT CG C TCTG AG ATG AGC AC CAT C CGG C AG AACCTGGGGGT CTGT C C C C AG 


2880 


Qy 


2881 


C AT AACGTG CTG T TTG AC ATGCTG ACTGT CG AAG AAC AC ATCTGGTTCT ATG C C CGCTTG 


2940 




IIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIilllllllMIIIIIIIIIIIIIIIII 




Db 


2881 


C ATAACGTGCTGTTTG AC ATGCTG ACTGT CG AAG AAC AC ATCTGGTTCT ATG CC CGCTTG 


2940 


Qy 


2941 


AAAGGGCTCTCTGAGAAGCACGTGAAGGCGGAGATGGAGCAGATGGCCCTGGATGTTGGT 


3000 




IIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


2941 


AAAGGGCTCTCTGAGAAGCACGTGAAGGCGGAGATGGAGCAGATGGCCCTGGATGTTGGT 


3000 


Qy 


3001 


TTGC CATC AAG C AAG C TG AAAAGC AAAAC AAG C C AG CTGT C AGGTGG AATG C AG AG AAAG 


3060 




I II 1 II 1 I II 1 ll II II 1 1 1 II II 1 1 1 II II M I II h MM | | ! ! | | l| < | | 1 | || || | 




Db 


3001 


TTGC CATC AAG C AAG C TG AAAAGC AAAAC AAG CCAGCTGT C AGGTGG AATG C AG AG AAAG 


3060 


Qy 


3061 


CTATCTGTGGCCTTGGCCTTTGTCGGGGGATCTAAGGTTGTCATTCTGGATGAACCCACA 


3120 




MM MM MMMMM MIMMM MM MMMMMMMMMM Mi MM M 




Db 


3061 


CTATCTGTGGCCTTGGCCTTTGTCGGGGGATCTAAGGTTGTCATTCTGGATGAACCCACA 


3120 


Qy 


3121 


G CTGGTGTGG ACC C TT ACTC C CGCAGGGG AAT ATGGG AG CTG CTG C TG AAAT AC CG AC AA 


3180 




Ml 1 II IMM II II II M 1 M III M MIMM II II ill M II 1 M M M M 1 1 1 1 1 1 




Db 


3121 


G C TGGTGTGG ACC CTT ACT C C CGCAGGGG AAT ATGGG AGCTG CTG C TG AAAT AC CG AC AA 


3180 


Qy 


3181 


GG CCG C ACC ATT ATT CTCT CT AC AC ACC AC ATGG ATG AAGCGG ACG TCC TGGGGG AC AGG 


3240 




Ml 1 II II II Ml II II M 1 M II 1 M II IM M II IMM MM 1 M 1 1 1! M 1 1 III 1 




Db 


3181 


GGCCGCACCATTATTCTCTCTACACACCACATGGATGAAGCGGACG TCC TGGGGG AC AGG 


3240 


Qy 


3241 


ATTGC C ATC AT CT CCC ATGGG AAG CTGTGCTGTGTGGGCTCCTCCC TGT TT CTG AAG AAC 


3300 




Ml 1 MMM 1 II MM M 1 Mill M II II Mill II Ml 1! II 1 M 1 1 II M 1 1 MM 




Db 


3241 


ATTGC C ATC ATCTCCC ATGGG AAG CTGTGCTGTGTGGGCTCCTCCCTGTTT CTG AAG AAC 


3300 


Qy 


3301 


C AG CTGGG AAC AGG CT ACT AC CTG ACCTTGGTC AAG AAAG ATGTGG AAT C CT C CC TC AGT 


3360 




1 Ml II 1! IMM 1 MM M Mill M MIMM MIMM II IMM Mill M II II 1 




Db 


3301 


C AG C TGGG AAC AGG CT ACT ACCTG ACCTTGGTC AAG AAAG ATGTGG AATC CT C CCTC AGT 


3360 


Qy 


3361 


TCCTG C AG AAAC AGT AGT AG C AC TGTGT C AT AC CTG AAAAAGG AGG AC AGTGTTTCTC AG 


3420 



Ml MMMMMMI MMM MIMMMMMI MMMMM MIM MM MM 



Db 


3361 


Qy 


3421 


Db 


3421 


Qy 


3481 


Db 


3481 


Qy 


3541 


Db 


3541 


Qy 


3601 


Db 


3601 


Qy 


3661 


Db 


3661 


Qy 


3721 


Db 


3721 


Qy 


3781 


Db 


3781 


Qy 


3841 


Db 


3841 


Qy 


3901 


Db 


3901 


Qy 


3961 


Db 


3961 


Qy 


4021 


Db 


4021 


Qy 


4081 


Db 


4081 


Qy 


4141 


Db 


4141 


Qy 


4201 


Db 


4201 


Qy 


4261 


Db 


4261 


Qy 


4321 


Db 


4321 


Qy 


4381 


Db 


4381 


Qy 


4441 



TCCTGCAGAAACAGTAGTAGCACTGTGTCATACCTGAAAAAGGAGGACAGTGTTTCTCAG 3420 
AG C AGTTCTG ATG CTGG C CTGGG C AG CG AC C ATG AG AGTGACACG CTG AC C ATCG ATGTC 34 80 

INI III llllllll llllllll MMIIIIMIIIII MM MM Ml IMlllll 1 1 

AGC AGTT CTG ATGCTGG CCTGGG C AG CG AC C ATG AG AGTG AC ACGCTG AC C AT CG ATGTC 34 80 
TCTGCTATCTCCAACCTCATCAGG AAGC ATGTGTCTGAAGCCCGGCTGGTGGAAGACATA 354 0 

1 1 1 111 II I 1 1 1 1 1 1 1 1 1 1 i! 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 Mi I 

TCTGCTATCTCCAACCTCATCAGG AAGCATGTGTCTGAAGCCCGGCTGGTGGAAG ACATA 354 0 
GGG C ATG AG CTG AC CT ATGTG CTG C C AT ATG AAG CTGCT AAGG AGGG AG CCTTTGTGG AA 3600 

lllllllllll MMMMMIIMMMMIMMMMMMIIMMIMIIMM 

GGG C ATG AG CTG AC CT ATG TG CTG CC AT ATG AAG CTGCT AAGG AGGG AG C CTTTG TGG AA 3600 
CTCTTTCATGAGATTGATGACCGGCTCTCAGACCTGGGCATTTCTAGTTATGGCATCTCA 3660 

IIIIIIIIIIIMIIIIIMIIIIIIIIIII IMMIMMIMMMMMII illlll 

CTCTTTCATGAGATTGATGACCGGCTCTCAGACCTGGGCATTTCTAGTTATGGCATCTCA 3660 
GAG ACGACCCTGGAAGAAATATTCCTCAAGGTGGCCGAAGAGAGTGGGGTGG ATGCTGAG 3720 

I M M II I I I II II II M M IMMM II M MM II M M MM MMI I Ml IM I II 

GAGACGACCCTGGAAGAAATATTCCTC AAGGTGGCCGAAGAGAGTGGGGTGGATGCTGAG 372 0 
ACCTCAGATGGTACCTTGCCAGCAAGACGAAACAGGCGGGCCTTCGGGGACAAGCAGAGC 3780 

I M MM I I I II M I ! I I II M M II I II II MIMI M M.MM- M I I Ml Ml I II 

ACCTCAGATGGTACCTTGCCAGCAAGACGAAACAGGCGGGCCTTCGGGGACAAGCAGAGC 3780 
TGTCTT CG C CCGTTC ACTGAAG ATG ATG C TG CTG ATCCAAATG ATT CTG AC AT AG ACC C A 3840 

I II II IM M 1 1 1 1 1 1 Mil M M >l Ml II II II II M M M M MM: I M I II M II 

TGTCTTCGC C CGTTC ACTGAAG ATG ATG CTG C TG ATC C AAATG ATT CTG AC AT AGAC CCA 3840 
GAATCCAGAGAGACAGACTTGCTCAGTGGGATGGATGGCAAAGGGTCCTACCAGGTGAAA 3 900 

!!l II II I M I I M M II M IM II 1 1 MMM I! M M II II II MM I i I M M IMi 

GAATCCAGAGAGACAGACTTGCTCAGTGGGATGGATGGCAAAGGGTCCTACCAGGTGAAA 3 900 
GGCTGGAAACTTACACAGCAACAGTTTGTGGCCCTTTTGTGGAAGAGACTGCTAATTGCC 3 960 

I M II II MM I II II IMMI M I 1 1 - M M II M 'Ml II II II II M I IMI I! M 

GG CTGG AAAC TT AC AC AG C AAC AGTTTGTGGC CCTT TTGTGG AAG AG ACTG C T AATTGC C 3960 
AGACGGAGTCGGAAAGGATTTTTTGCTCAGATTGTCTTGCCAGCTGTGTTTGTCTGCATT 4 020 

IMu || 1 1 1 ! I M M M II M II II M I M II IM! II IMI M M II II Ml M M II 

AGACGGAGTCGGAAAGGATTTTTTGCTCAGATTGTCTTGCCAGCTGTGTTTGTCTGCATT 4 02 0 
GCCCTTGTGTTCAGCCTGATCGTGCCACCCTTTGGCAAGTACCCCAGCCTGGAACTTCAG 4 080 

I II MM 1 1 1 1 MMI M ,1 IMI M M I IMMMI II M M IMMMI I M MMM 

GCCCTTGTGTTCAGCCTGATCGTGCCACCCTTTGGCAAGTACCCCAGCCTGGAACTTCAG 4 080 
C CCTGG ATG T AC AACG AACAGT AC AC ATTTGT C AG C AATG ATG CT C CTG AGG AC ACGGG A 414 0 

I M M Ml II I I M IM 'M I! M M I I I II M IMI II MMMMM II I I Ml I M I 

C C CTGG ATG T AC AACG AACAGT AC AC AT TTGT C AGC AATGATG CT C CTG AGG AC ACGGG A 414 0 
ACCCTGGAACTCTTAAACGCCCTCACCAAAGACCCTGGCTTCGGGACCCGCTGTATGGAA 4200 

I II II IM 1 1 1 1 II I 1 1 Ml II I I IM I I II M ll II II MM'MMMI 1 1 Ml IM I 

ACCCTGGAACTCTTAAACGCCCTCACCAAAGACCCTGGCTTCGGGACCCGCTGTATGGAA 4200 
GG AAAC C C AATCCC AG AC ACGC C CTG CC AGGC AGGGG AGGAAG AGTGG AC C ACTG C C C C A 4 260 

I II II II 1 1 M 1 1 1 1 1 1 1 M II 1 1 MM 1 1 1 II 1 1 1 1 II I II MMM IM 1 1 M 1 1 Ml 

GG AAAC C C AATCC C AG AC ACGCC C TG C C AGGC AGGGG AGGAAG AGTGG AC C ACTG CCC C A 4260 
GTTCCCCAGACCATCATGGACCTCTTCCAGAATGGGAACTGGACAATGCAGAACCCTTCA 4320 

IM II II MM 1 1 1 II II II M IMI 1 1 1 M M II II II II II II MMMM M I II II 

GTTCCCCAGACCATCATGGACCTCTTCCAGAATGGGAACTGGACAATGCAGAACCCTTCA 4320 
C CTG C ATG C C AG TGT AGC AGCG AC AAAATC AAG AAG ATG CTGCCTGTGTGT C CCCC AGGG 4380 

Ml II II II M I M II II II IMI M M Ml II II II MM II II IMI IMM M II II 

CCTG C ATG C C AG TG T AGC AGCG AC AAAATC AAG AAG ATG CTGCCTGTGTGT CC CCC AGGG 4380 

GCAGGGGGGCTGCCTCCTCCACAAAGAAAACAAAACACTGCAGATATCCTTCAGGACCTG 444 0 

III II II II I I III IMI II II II II II I II II II Illlll II II II II II I I I II MM 

G C AGGGGGGCTG C CTC CT CC AC AAAG AAAAC AAAACACTGC AG AT ATC CTTC AGG AC CTG 444 0 

AC AGG AAG AAAC ATTTCGG ATT ATCTGG TG AAG ACGT ATGTGC AG AT CAT AG C C AAAAG C 4 500 



Db 


4441 


Qy 


4501 


Db 


4501 


Qy 


4561 


Db 


4561 


Qy 


4621 


Db 


4621 


Qy 


4681 


Db 


4681 


Qy 


4741 


Db 


4741 


Qy 


4801 


Db 


4801 


Qy 


4861 


Db 


4861 


Qy 


4921 


Db 


4921 


Qy 


4981 


Db 


4981 


Qy 


5041 


Db 


5041 


Qy 


5101 


Db 


5101 


Qy 


5161 


Db 


5161 


Qy 


5221 


Db 


5221 


Qy 


5281 


Db 


5281 


Qy 


5341 


Db 


5341 


Qy 


5401 


Db 


5401 


Qy 


5461 


Db 


5461 



lllllllllllllllllllllllllllllllllllllllllilllllll IIMIIIIM 



TT AAAG AAC AAG ATCTGGGTG AATG AGTTT AGGT ATGG CGG CTTTTC CCTGGGTGTC AGT 4560 

IMIIIMIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIMIMIIIII IMIIIIII 

TTAAAGAACAAGATCTGGGTGAATGAGTTTAGGTATGG CGG CTTTTC CCTGGGTGTC AGT 4560 
AAT ACTC AAG C ACTTCCT CCG AGT C AAG AAGTT AATG ATG CC AC C AAAC AAATG AAG AAA 4620 

II II 1 1 II 1 1 1 1 1 II II I IIMIIMIII 1 1 1 1 1 II 1 1 1! Ill Ml !l INI II 1 1 1 II I 

AAT ACTC AAG CACTTCCTC CG AGT C AAG AAGTT AATG ATG CC AC C AAAC AAATG AAG AAA 4620 
CACCTAAAGCTGGCCAAGGACAGTTCTGCAGATCGATTTCTCAACAGCTTGGGAAGATTT 4 680 

II II II II 1 1 1 1 1 II III IMliH II II 1 1 1 1 Mill li MM M II II I I II 1 1 Ml I 

CACCTAAAGCTGGCCAAGGACAGTTCTGCAGATCGATTTCTCAACAGCTTGGGAAGATTT 4 680 
ATGACAGGACTGGACACCAGAAATAATGTCAAGGTGTGGTTCAATAACAAGGGCTGGCAT 4 740 

II Ml MM M M Mill MMM IMIMI MMM M M II MM Mill I MM i iM 

ATGACAGGACTGGACACCAGAAATAATGTCAAGGTGTGGTTCAATAACAAGGGCTGGCAT 4740 
GC AAT C AG CT CTTT C CTG AATGT CAT C AAC AATG CC ATT CTC CGGGC C AAC CTG C AAAAG 4 800 

iiiiiiiiiiiiiiiiiiiimmiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

GCAATCAGCTCTTTCCTGAATGTCATCAACAATGCCATTCTCCGGGCCAACCTGCAAAAG 48 00 
GG AG AG AAC C CT AG C C ATT ATGG AATTACTG CTTT C AAT C AT CC C CTG AATC TC AC C AAG 4860 

i I M M IM . I M mM iMMI II II II I M M h M I I II MM I I M M MM Ml I 

GG AG AG AACC CT AG C C ATT ATGG AATTACTG CTTT C AAT C AT CC CCTG AATC TC AC C AAG 4860 
C AG C AG CT CT C AG AGGTGGCT CCG ATG AC C AC ATC AGTGGATGT CCTTG TGTC C AT CTG T 4920 

iiiMiiiiiiiimiiimiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

CAGCAGCTCTCAGAGGTGGCTCCGATGACCACATCAGTGGATGTCCTTGTGTCCATCTGT 4 920 
GTCATCTTTGCAATGTCCTTCGTCCCAGCCAGCTTTGTCGTATTCCTGATCCAGGAGCGG 4980 

iiimiiimiiiiiiiiiiiiiiiiiiiiMiiiiiiiiiiiiiiiiiiiiimii 

GTCATCTTTGCAATGTCCTTCGTCCCAGCCAGCTTTGTCGTATTCCTGATCCAGGAGCGG 4980 
GT C AGC AAAG C AAAAC ACC TG CAGTTC AT C AGTGG AGTG AAG CC TGT C ATCT ACTGG C TC 504 0 

IM; il IMM I II 1 1 1 li II II II IM.nM I li II 1 1 1 1 II M M 1 1 1| |i 1 1 Ml I 

GT C AGC AAAG C AAAAC ACC TG CAGTTC ATC AGTGG AGTG AAG CCTGTC ATCT ACTGG CTC 5040 
TCTAATTTTGTCTGGGATATGTGCAATTACGTTGTCCCTGCCACACTGGTCATTATCATC 5100 

IM! II IMM I Ml M M MM II IM M M 1 1 II II M M M Mi 1 1 1 M M 1 1 1 M I 

TCTAATTTTGTCTGGGATATGTGCAATTACGTTGTCCCTGCCACACTGGTCATTATCATC 5100 
TTCATCTGCTTCCAGCAGAAGTCCTATGTGTCCTCCACCAATCTGCCTGTGCTAGCCCTT 5160 

II MM MM I MM II II II M II II I M !M I II II M II II I M 1 1 1 II M I M II I 

TTCATCTGCTTCCAGCAGAAGTCCTATGTGTCCTCCACCAATCTGCCTGTGCTAGCCCTT 5160 
CTACTTTTGCTGTATGGGTGGTCAATCACACCTCTCATGTACCCAGCCTCCTTTGTGTTC 5220 

II I I II IMI 1 1 1 i I II II II II II II Mi <M I II II MM M II I I IMI M M I M I 

CTACTTTTGCTGTATGGGTGGTCAATCACACCTCTCATGTACCCAGCCTCCTTTGTGTTC 52 20 
AAGATCCCCAGCACAGCCTATGTGGTGCTCACCAGCGTGAACCTCTTCATTGGCATTAAT 5280 

II 1 1 II 1 1 II M M I II II MM II II M II I M II II MM IMI M IM ill M I II I 

AAGATCCCCAGCACAGCCTATGTGGTGCTCACCAGCGTGAACCTCTTCATTGGCATTAAT 52 80 
GGCAGCGTGGCCACCTTTGTGCTGGAGCTGTTCACCGACAATAAGCTGAATAATATCAAT 5340 

II I I II I I II 1 1 1 1 1 II II II M II M M II 1 1 1 II II M II II M II M IM 1 1 1 1 II I 

GGC AGCGTGG CC ACCTTTGTG CTGG AG CTGTT C ACCG AC AAT AAGCTG AAT AAT ATC AAT 5340 
GATATCCTGAAGTCCGTGTTCTTGATCTTCCCACATTTTTGCCTGGGACGAGGGCTCATC 54 00 

I IMM II il I I I IMI I I MMM II II M Ml II II II II IMMI II II I I I I IM I 

GATATCCTGAAGTCCGTGTTCTTGATCTTCCCACATTTTTGCCTGGGACGAGGGCTCATC 54 00 
G AC ATGGTG AAAAAC C AGGC AATGGCTG ATG C C CTGG AAAGGTTTGGGG AG AATCG CTTT 54 60 

1 1 II II II 1 1 1 M M II 1 1 II II II II II MMI II II II 1 1 1 1 II M i 1 1 1 1 1 1 Mill 

GACATGGTGAAAAACCAGGCAATGGCTGATGCC CTGG AAAGGTTTGGGG AG AATCG CTTT 54 60 
GTG T C ACC ATT ATC TTGGGACTTGGTGGG ACGAAACCTCTTCGC C ATGG CCGTGGAAGGG 5520 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

GTGTCACCATTATCTTGGGACTTGGTGGGACGAAACCTCTTCGCCATGGCCGTGGAAGGG 5520 



Qy 


5521 


GTGGTGTTCTTCCTCATTACTGTTCTGATCCAGTACAGATTCTTCATCAGGCCCAGACCT 


5580 




illil II li II II 1 1 1 Mil INN II II II 1 1 Mill III Mil II II 1 M II II II II 




Db 


5521 


G TGGTG TTCTT C CTC ATT ACTGTTCTG ATC C AGTAC AG ATT CTTC AT CAGG CCC AG AC CT 


5580 


Qy 


5581 


GTAAATGCAAAGCTATCTCCTCTGAATGATGAAGATGAAGATGTGAGGCGGGAAAGACAG 


5640 




1 1 II Mill II IM M MM IM II llll II M MM! Ill IMI II II 1 M II IMMI 




Db 


5581 


GT AAATG C AAAG CT ATCT C CT CTG AATG ATG AAG ATGAAG ATGTG AGG CGGG AAAG AC AG 


5640 


Qy 


5641 


AGAATTCTTGATGGTGGAGGCCAGAATGACATCTTAGAAATCAAGGAGTTGACGAAGATA 


5700 




n n i n n ii 1 1 1 1 1 imi n i n n n n 1 1 nnnm ni n ii 1 1 1 n n nn 




Db 


5641 


AGAATTCTTGATGGTGGAGGCCAGAATGACATCTTAGAAATCAAGGAGTTGACGAAGATA 


5700 


Qy 


5701 


TATAGAAGGAAGCGGAAGCCTGCTGTTGACAGGATTTGCGTGGGCATTCCTCCTGGTGAG 


5760 




IMM II IMI 1 1 1 M II II IMI 1 II II II 1 1 1 IlilM II 1 II nn M Ml llll II 




Db 


5701 


TATAGAAGGAAGCGGAAGCCTGCTGTTGACAGGATTTGCGTGGGCATTCCTCCTGGTGAG 


5760 


Qy 


5761 


TGCTTTGGGCTCCTGGGAGTTAATGGGGCTGGAAAATCATCAACTTTCAAGATGTTAACA 


5820 




1! IM II II II 1 1 1 M II IMMI 1 II II II 1 Mil II ll 1 1 1 II 1 1 II M MMIII II 




Db 


5761 


TG CTTTGGG CT C CTGGG AGTT AATGGGGCTGG AAAATC ATC AAC TTT C AAG ATGTT AAC A 


5820 


Qy 


5821 


GGAGATACCACTGTTACCAGAGGAGATGCTTTCCTTAACAGAAATAGTATCTTATCAAAC 


5880 






IMI Ml II 1 1 II M 1 II li IM'MI II II 1 MM II Ml M II MM M 1 MM 1 I II 




Db 


5821 


GGAGATACCACTGTTACCAGAGGAGATGCTTTCCTTAACAGAAATAGTATCTTATCAAAC 


5880 


Qy 


5881 


ATC C ATG AAGT AC AT C AG AAC ATGGG CT ACTGC C CT C AGT TTG ATGC CAT C AC AG AG CTG 


5940 




IMI 1 M II i 1 M 1 1 Ml II Mm 1 M II M M iM II M 1 1 1 IMMI M 1 1 Ml 1 1 II 




Db 


5881 


AT CC ATG AAGT AC AT C AG AAC ATGGG CTACTG C C CTC AG T TTG ATGC CAT CAC AG AG CTG 


5940 


Qy 


5941 


TTG ACTGGG AG AG AAC ACGTGG AGTTCTTTG CC C TTTTG AG AGG AGT CC C AG AG AAAG AA 


6000 




illl Ml II M II 1 M M M M M 1 1| || |m i mi || |< 1 1 1 !MI || < 1 1 1 | ;| i 1 1| 




Db 


5941 


TTG ACTGGG AG AG AAC ACGTGG AG TTCTTTG CC C TTTTG AG AGG AGT C CC AG AG AAAG AA 


6000 


Qy 


6001 


G TTGG CAAGGT TGGTG AGTGGG CGATTCGGAAACTGGGCCTCGTG AAGT ATGG AG AAAAA 


6060 




Ml 1 1 1 i II IMI 1 M M 1 1 Mn 1 1| || || M Ml II II 1 M II II II I M 1 IM M II 




Db 


6001 


GTTGGCAAGGT TGGTG AGTGGG CGATTCGGAAACTGGGCCTCGTG AAGT ATGG AG AAAAA 


6060 


Qy 


6061 


TATGCTGGTAACTATAGTGGAGGCAACAAACGCAAGCTCTCTACAGCCATGGCTTTGATC 


6120 




Ml 1 1 IMI IMM M 1 1 1 1 IM MMMI II M Ml II II n 1 II IMI 1 M II IMI II 




Db 


6061 


TATGCTGGTAACTATAGTGGAGGCAACAAACGCAAGCTCTCTACAGCCATGGCTTTGATC 


6120 


Qy 


6121 


GG CGGGC CT CCTGTGGTGTTT CTGG ATG AACCC AC C AC AGG C ATGGAT C CC AAAG C C CGG 


6180 




M M 1 II M II M 1 M III! II MMMI II 1 M IMI II M 1 II IM 1 1 M II II II II 




Db 


6121 


GGCGGGCCTCCTGTGGTGTTTCTGGATGAACCCACCACAGGCATGGATCCCAAAGCCCGG 


6180 


Qy 


6181 


CGGTTCTTGTGGAATTGTGCCCTAAGTGTTGTCAAGGAGGGGAGATCAGTAGTGCTTACA 


6240 




IMM IMI II M 1 1 1 IMMI M 1 II II II M Mill II 1 1 1 III IM M 1 II II 1 Ml 




Db 


6181 


CGGTTCTTGTGGAATTGTGCCCTAAGTGTTGTCAAGGAGGGGAGATCAGTAGTGCTTACA 


6240 


Qy 


6241 


TCT CAT AGT ATGG AAG AATG TG AAG CT CTTTG C ACT AGG ATGGC AAT C ATGGTC AATGG A 


6300 




M II III M II 1 1 1 M II II II 1 Mi IM II 1 M MM II 1 1 1 II II M 1 M II II MM 




Db 


6241 


T CT CAT AGT ATGG AAG AATGTG AAG CTC TTTG C ACT AGG ATGGC AAT C ATGGTC AATGG A 


6300 


Qy 


6301 


AGGTTCAGGTGCCTTGGCAGTGTCCAGCATCTAAAAAATAGGTTTGGAGATGGTTATACA 


6360 




MM 1 II II II 1 1 1 M II II II 1 MMM II 1 M MM II II 1 II II 1! 1 M II II Ml 1 




Db 


6301 


AGGTTCAGGTGCCTTGGCAGTGTCCAGCATCTAAAAAATAGGTTTGGAGATGGTTATACA 


6360 


Qy 


6361 


ATAGTTGTACGAATAGCAGGGTCCAACCCGGACCTGAAGCCTGTCCAGGATTTCTTTGGA 


6420 






MMI II II II II 1 1 1 1 1 IMI 1 MM II II 1 M MM II 1 M II IMI 1 M II 1 1 III 1 




Db 


6361 


ATAGTTGTACGAATAGCAGGGTCCAACCCGGACCTGAAGCCTGTCCAGGATTTCTTTGGA 


6420 


Qy 


6421 


CTTGC ATTT C C TGG AAG TGTT C C AAAAG AG AAAC ACCGG AAC ATGCT AC AAT ACC AG CTT 


6480 




II II 1 II llll II 1 1 II 1 1 1 IM M llll II 1 1 1 II III! II 1 II II II 1 M II 1 1 II 1 1 




Db 


6421 


CTTGC ATTT C CTGG AAGTGTT C C AAAAG AG AAAC AC CGG AAC ATG CT AC AAT AC C AG CTT 


6480 


Qy 


6481 


C C ATCTT C ATT ATCTTC TCTGGC C AGG AT ATT C AG CATCCTCTCC C AG AG C AAAAAG CG A 


6540 






ii iM ii n ii ii mi i imi i m ii m niini iiii n i n iiii m i iiii ii i 




Db 


6481 


CCATCTTCATTATCTTCTCTGGCCAGGATATTCAGCATCCTCTCCCAGAGCAAAAAGCGA 


6540 


Qy 


6541 


CTC C AC AT AG AAG ACT ACTCTGTTT CTC AG AC AAC ACTTG ACC AAGT ATTTG TG AACTTT 


6600 






1 1 1 1 1 II 1 1 II 1 1 11 1 1 1 II 1 1 1 M 11 II II 1 1 1 1 E III 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 




Db 


6541 


CT CC AC AT AG AAG ACT ACTCTG T TT CT C AG AC AAC ACTTG AC C AAGT ATTTGTG AACTTT 


6600 



Qy 6601 GCC AAGG AC C AAAGTG ATGATG AC CACTT AAAAG AC CTCTC ATT AC AC AAAAAC C AG AC A 6660 

II II II Ml Nil 1 1 1 II II II I! I II Mil 1 1 1 II II II 1 1 1 M II II II I lllllll I 

Db 6601 GC C AAGG AC C AAAGTG ATGATG AC CACTT AAAAG AC CTCT C ATT AC AC AAAAAC C AG AC A 6660 

Qy 6661 GT AGTGG ACGTTG C AGTTCTC AC ATCTTTT CT AC AGGATG AG AAAG TG AAAG AAAG CT AT 6720 

MM I M MM II M I llllll IMIMI II I M MM III M MM II II I II MM 1 1 

Db 6661 GT AGTGG ACGTTG C AG TTCTC AC AT CTTTTCT AC AGGATG AG AAAGTG AAAG AAAGCT AT 6720 

Qy 6721 GTATGAAGAATCCTGTTCATACGGGGTGGCTGAAAGTAAAGAGGGACTAGACTTTCCTTT 6780 

II II I M li II II i MIIIMI M Ml II II I M M II M I Ml 1 1 1 1 1 1 1 1 1 1 1 1 1 M 

Db 6721 GTATGAAGAATCCTGTTCATACGGGGTGGCTGAAAGTAAAGAGGNACTAGACTTTCCTTT 6780 

Qy 6781 GCACCATGTGAAGTGTTGTGGAGAAAAGAGCCAGAAGTTGATGTGGGAAGAAGTAAACTG 6840 

M Ihii MM II I M IMM I I I I II M II! !MMI M I I I I I II II I M II II II I 

Db 6781 GCACCATGTGAAGTGTTGTGGAGAAAAGAGCCAGAAGTTGATGTGGGAAGAAGTAAACTG 684 0 

Qy 6841 G AT ACTGT ACTG AT ACT ATT C AATG C AATGC AATT C AATG 6880 

M MMMM M I M 1 1 MM ! M ! 1 1 ! M I M M Ml I : 

Db 6841 GAT ACTGT ACTG AT ACT ATT C AATG C AATG C AATT C AATG 6 880 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


6880 


100 


.0 


6880 


21 


AAZ94734 


Human ATP binding 


2 


6880 


100 


.0 


6880 


22 


AAI70314 


Human ATP binding 


3 


6880 


100 


.0 


7260 


22 


AAD21326 


Human ATP binding 


4 


6880 


100 


.0 


7260 


22 


AAI70315 


Human ATP binding 


5 


6879 


100 


.0 


9497 


24 


ABV78020 


Hypoxia- regulated 


6 


6876 


.8 


100 


.0 


6880 


22 


AAD21325 


Human ATP binding 


7 


6875 


.8 


99 


.9 


9741 


22 


AAS06120 


Human ABC1 DNA seq 


8 


6875 


.8 


99 


.9 


9741 


24 


AAD37273 


Human ABC1 full-le 


9 


6875 


.8 


99, 


.9 


9854 


22 


AAS06121 


Human ABC1 DNA seq 


10 


6875 


.8 


99. 


.9 


9870 


24 


ABN99301 


Polymorphic 


human 


11 


6875 


.8 


99 


.9 


9870 


24 


ABN99302 


Polymorphic 


human 


12 


6875 


.8 


99, 


.9 


9870 


24 


ABN99303 


Polymorphic 


human 


13 


6875 


.8 


99, 


.9 


9870 


24 


ABN99324 


Polymorphic 


human 


14 


6875 


.8 


99, 


.9 


9870 


24 


ABN99328 


Polymorphic 


human 


15 


6875 


.8 


99, 


.9 


9870 


24 


ABN99329 


Polymorphic 


human 


16 


6875 


. 8 


99, 


. 9 


9870 


24 


ABN99330 


Polymorphic 


human 


17 


6875 


.8 


99, 


.9 


9870 


24 


ABN99331 


Polymorphic 


human 


18 


6875 


.8 


99, 


.9 


9870 


24 


ABN99332 


Polymorphic 


human 


19 


6875 


.8 


99, 


. 9 


9870 


24 


ABN99333 


Polymorphic 


human 


20 


6875 


.8 


99, 


.9 


9870 


24 


ABN99334 


Polymorphic 


human 


21 


6875 


.8 


99, 


.9 


9870 


24 


ABN99335 


Polymorphic 


human 


22 


6875 


.2 


99, 


.9 


7281 


22 


AAK51683 


Human polynucleoti 


23 


6874 


.2 


99, 


.9 


9741 


24 


ABL58146 


Human ABCA1 


transp 


24 


6874 


.2 


99, 


.9 


9870 


24 


ABN99304 


Polymorphic 


human 


25 


6874 


.2 


99, 


.9 


9870 


24 


ABN99305 


Polymorphic 


human 


26 


6874 


.2 


99, 


.9 


9870 


24 


ABN99306 


Polymorphic 


human 


27 


6874 


.2 


99, 


.9 


9870 


24 


ABN99307 


Polymorphic 


human 


28 


6874 


. 2 


99, 


.9 


9870 


24 


ABN99309 


Polymorphic 


human 


29 


6874 


. 2 


99, 


.9 


9870 


24 


ABN99310 


Polymorphic 


human 


30 


6874 


.2 


99, 


.9 


9870 


24 


ABN99311 


Polymorphic 


human 


31 


6874 


.2 


99. 


.9 


9870 


24 


ABN99312 


Polymorphic 


human 


32 


6874 


.2 


99. 


.9 


9870 


24 


ABN99313 


Polymorphic 


human 


33 


6874 , 


.2 


99. 


.9 


9870 


24 


ABN99314 


Polymorphic 


human 


34 


6874 


.2 


99. 


.9 


9870 


24 


ABN99315 


Polymorphic 


human 


35 


6874 , 


.2 


99. 


9 


9870 


24 


ABN99316 


Polymorphic 


human 


36 


6874 , 


.2 


99. 


.9 


9870 


24 


ABN99317 


Polymorphic 


human 


37 


6874 , 


.2 


99. 


.9 


9870 


24 


ABN99318 


Polymorphic 


human 


38 


6874 


.2 


99. 


9 


9870 


24 


ABN99319 


Polymorphic 


human 


39 


6874 


. 2 


99. 


,9 


9870 


24 


ABN99320 


Polymorphic 


human 


40 


6874. 


.2 


99. 


.9 


9870 


24 


ABN99321 


Polymorphic 


human 


41 


6874 , 


.2 


99. 


,9 


9870 


24 


ABN99322 


Polymorphic 


human 


42 


6874 . 


.2 


99. 


9 


9870 


24 


ABN99323 


Polymorphic 


human 


43 


6874 . 


.2 


99. 


9 


9870 


24 


ABN99325 


Polymorphic 


human 


44 


6874 . 


.2 


99. 


9 


9870 


24 


ABN99326 


Polymorphic 


human 



45 6874.2 



99.9 



9870 24 ABN99327 



Polymorphic human 



SUMMARIES 



Result 



% 

Query 





No. 


Score 


Match 


Length DB 


ID 


Description 




1 


430.8 


6, 


.3 


5894 


3 


US-08-665-259-24 


Sequence 


24, Appl 




2 


430.8 


6, 


.3 


5894 


3 


US-08-762-500-24 


Sequence 


24, Appl 




3 


430.8 


6, 


.3 


6525 


3 


US-08-762-500-74 


Sequence 


74, Appl 




4 


99.4 


1, 


,4 


1008 


4 


US-09-252-991A-4266 


Sequence 


4266, Ap 


c 


5 


98.8 


1. 


.4 


999 


4 


US-09-252-991A-4588 


Sequence 


4588, Ap 




6 


89.4 


1. 


,3 


1614 


4 


US-09-252-991A-4198 


Sequence 


4198, Ap 




7 


85.8 


1 . 


.2 


1824 


4 


US-09-252-991A-1780 


Sequence 


1780, Ap 




8 


83 .8 


1 , 


. 2 


1176 


4 


US-09-252-991A-3704 


Sequence 


3704, Ap 


c 


9 


83.8 


1, 


.2 


2253 


4 


US-09-252-991A-3810 


Sequence 


3810, Ap 




10 


81 


1 , 


.2 


36181 


4 


US-08-311-731A-120 


Sequence 


120, App 




11 


78 .4 


1. 


. 1 


1548 


4 


US-0 9-252-991A-36 53 


Sequence 


3653, Ap 


c 


12 


77 .4 


1, 


,1 


762 


4 


US-09-252-991A-1938 


Sequence 


1938, Ap 


c 


13 


76.6 


1 . 


.1 


4411529 


3 


US-09-103-840A-1 


Sequence 1, Appli 




14 


75.2 


1, 


,1 


969 


4 


US-09-107-532A-1008 


Sequence 


1008, Ap 




15 


75 


1. 


.1 


1770 


4 


US-09-328-352-3466 


Sequence 


3466, Ap 



SUMMARIES 



Result 



% 

Query 





No. 


Score 


Match Length 


DB 


ID 


Description 




1 


3600.4 


52 


.3 


4783 


11 


AK051920 


AK051920 


Mus muscu 




2 


1024 .8 


14 , 


.9 


3605 


11 


AK052916 


AK052916 


Mus muscu 




3 


692 .4 


10 


.1 


809 


12 


BI754756 


BI754756 


603025477 




4 


636 .6 


9 


.3 


854 


12 


BI854140 


BI854140 


603381449 




5 


624 .4 


9 


.1 


754 


14 


CD351847 


CD351847 


UI-M-GI0- 


c 


6 


591.2 


8, 


.6 


878 


12 


BI182779 


BI182779 


UNL-P-FN- 




7 


587.8 


8, 


.5 


673 


14 


CD351490 


CD351490 


UI-M-GI0- 


c 


8 


550.4 


8, 


.0 


661 


14 


CB481810 


CB481810 


jns83_A02 




9 


547.6 


8 . 


.0 


773 


14 


CB526974 


CB526974 


UI-M-FY0- 




10 


542 


7, 


.9 


837 


10 


BF160011 


BF160011 


601768192 




11 


536.8 


7 , 


. 8 


652 


10 


BB468374 


BB468374 


BB468374 




12 


535.2 


7 . 


.8 


702 


12 


BI658600 


BI658600 


603284335 




13 


532.4 


7 . 


. 7 


724 


12 


BG920223 


BG920223 


602822304 


c 


14 


528 .4 


7 , 


, 7 


639 


14 


CB477340 


CB477340 


jns20_Cll 




15 


525 


7 , 


,6 


720 


12 


BI248317 


BI248317 


602966918 




16 


516 


7. 


.5 


620 


14 


CA889361 


CA889361 


B0152D10- 




17 


510.4 


7, 


,4 


512 


13 


BX279856 


BX279856 


BX279856 


c 


18 


505 


7, 


,3 


614 


14 


CB478239 


CB478239 


jns32_B02 




19 


486 .4 


7 . 


.1 


605 


14 


CB546553 


CB546553 


AMGNNUC : N 




20 


479.8 


7 , 


, 0 


579 


14 


CA884367 


CA884367 


B0109G01- 




21 


477 


6. 


.9 


1004 


11 


BC034824 


BC034824 


Homo sapi 




22 


473 


6. 


.9 


562 


10 


BE665489 


BE665489 


154542 MA 




23 


459.4 


6. 


, 7 


515 


10 


BF094524 


BF094524 


MR0-UT004 




24 


456 .8 


6. 


6 


605 


14 


CB581583 


CB581583 


AMGNNUC : N 




25 


447.6 


6. 


5 


554 


10 


BF076322 


BF076322 


225856 MA 




26 


438.4 


6. 


.4 


512 


14 


CB717058 


CB717058 


AMGNNUC : N 




27 


435.6 


6 . 


3 


884 


13 


BU906084 


BU906084 


AGENCOURT 



RESULT 1 
AK051920 
LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 



AK051920 4783 bp mRNA linear HTC 05-DEC-2002 

Mus musculus 12 days embryo eyeball cDNA, RIKEN full-length 
enriched library, clone :D230019D04 product : ATP-binding cassette, 
sub-family A (ABC1) , member 1, full insert sequence. 
AK051920 

AK051920.1 GI: 26342297 

HTC; CAP trapper. 

Mus musculus (house mouse) 



ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 
PUBMED 

REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
REFERENCE 
AUTHORS 



Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 

Carninci,P. and Hayashizaki, Y . 

High-efficiency full-length cDNA cloning 

Meth. Enzymol. 303, 19-44 (1999) 

99279253 

10349636 

2 

Carninci,P., Shibata,Y., Hayatsu,N., Sugahara,Y., Shibata,K., 

Itoh,M., Konno ( H. ( Okazaki,Y., Muramatsu,M. and Hayashizaki, Y. 

Normalization and subtraction of cap- trapper- selected cDNAs to 

prepare full-length cDNA libraries for rapid discovery of new gene 

Genome Res. 10 (10), 1617-1630 (2000) 

20499374 

11042159 

3 

Shibata,K., Itoh,M., Aizawa,K., Nagaoka,S., Sasaki, N., Carninci,P. 
Konno,H., Akiyama,J., Nishi,K., Kitsunai,T., Tashiro,H., Itoh,M., 
Sumi,N., Ishii,Y., Nakamura, S . , Hazama,M., Nishine,T., Harada,A., 
Yamamoto,R., Matsumoto, H . , Sakaguchi , S . , Ikegami,T., Kashiwagi, K. , 
Fujiwake,S. , Inoue,K., Togawa,Y., Izawa,M., Ohara,E., Watahiki,M., 
Yoneda,Y., Ishikawa,T., Ozawa,K., Tanaka,T., Matsuura,S., Kawai,J. 
Okazaki,Y., Muramatsu, M. , Inoue,Y., Kira,A. and Hayashizaki , Y , 
RIKEN integrated sequence analysis (RISA) system- -384 -format 
sequencing pipeline with 384 multicapillary sequencer 
Genome Res. 10 (11), 1757-1771 (2000) 
20530913 
11076861 
4 

Kawai,J., Shinagawa, A. , Shibata,K., Yoshino,M., Itoh,M., Ishii,Y., 
Arakawa,T., Hara,A., Fukunishi , Y . , Konno,H., Adachi,J., Fukuda,S., 
Aizawa,K., Izawa,M., Nishi,K., Kiyosawa,H., Kondo,S., Yamanaka,I., 
Saito,T., Okazaki,Y., Goj obori , T . , Bono,H., Kasukawa,T., Saito,R., 
Kadota,K., Matsuda,H., Ashburner , M . , Batalov,S., Casavant,T., 
Fleischmann, W . , Gaasterland, T . , Gissi,C, King,B., Kochiwa,H., 
Kuehl,P., Lewis, S., Matsuo,Y., Nikaido,I., Pesole,G., 
Quackenbush, J . , Schriml , L .M . , Staubli,F., Suzuki ,R., Tomita,M., 
Wagner, L . , Washio,T., Sakai,K., Okido,T., Furuno,M., Aono,H., 
Baldarelli, R . , Barsh,G., Blake, J. , Boffelli,D., Bojunga,N., 
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FEATURES Location/Qualifiers 
source 1 . .4783 

/organism= "Mus musculus" 

/mol_t ype = 11 mRNA " 

/strain= u C57BL/6J" 

/db_xre f = " FANT0M_DB :D230019D04" 

/db_xref="taxon: 10090" 

/clone="D230019D04" 

/tissue_type=" eyeball" 

/clone_lib= "RIKEN full-length enriched mouse cDNA library" 
/dev_stage=" 12 days embryo" 
CDS 322. .>4782 

/not e=" unnamed protein product; ATP-binding cassette, 

sub- family A (ABC1), member 1 (MGD | MGI : 99607 , 

GB|NM_013454 , evidence: BLASTN, 99%, match=4545) 

putative " 

/codon_start=l 

/protein_id= 11 BAC34 81 1 . 1 " 

/db_xref="GI: 26342298" 

/translation " MACWPQLRLLLWKNLTFRRRQTCQLLLEVAWPLFIFLILISVRL 
SYPPYEQHECHFPNKAMPSAGTLPWVQGIICNANNPCFRYPTPGEAPGWGNFNKSIV 
SRLFSDAQRLLLYSQRDTSIKDMHKVLRMLRQIKHPNSNLKLQDFLVDNETFSGFLQH 
NLSLPRSTVDSLLQANVGLQKVFLQGYQLHLASLCNGSKLEEIIQLGDAEVSALCGLP 
RKKLDAAERVLRYNMDILKPWTKLNSTSHLPTQHLAEATTVLLDSLGGLAQELFSTK 
SWSDMRQEVMFLTNVNSSSSSTQIYQAVSRIVCGHPEGGGLKIKSLNWYEDNNYKALF 
GGNNTEEDVDTFYDNSTTPYCNDLMKNLESSPLSRIIWKALKPLLVGKILYTPDTPAT 
RQVMAEVNKTFQELAVFHDLEGMWEELSPQIWTFMENSQEMDLVRTLLDSRGNDQFWE 
QKLDGLDWTAQDIMAFLAKNPEDAQSPNGSVYTWREAFNETNQAIQTISRFMECVNLN 
KLEPXPTEVRLINKSMELLDERKFWAGIVFTGITPDSVELPHHVKYKIRMDIDNVERT 
NKIKDGYWDPGPRADPFEDMRYVWGGFAYLQDWEQAIIRVLTGSEKKTGVYVQQMPY 
PCYVDDIFLRVMSRSMPLFMTLAWIYSVAVIIKSIVYEKEARLKETMRIMGLDNGILW 
FSWFVSSLIPLLVSAGLLWILKLGNLLPYSDPSWFVFLSVFAMVTILQCFLISTLF 
SRANLAAACGGIIYFTLYLPYVLCVAWQDYVGFSIKIFASLLSPVAFGFGCEYFALFE 
EQGIGVQWDNLFESPVEEDGFNLTTAVSMMLFDTFLYGVMTWYIEAVFPGQYGIPRPW 
YFPCTKSYWFGEEIDEKSHPGSSQKGVSEICMEEEPTHLRLGVSIQNLVKVYRDGMKV 
AVDGLALNFYEGQITSFLGHNGAGKTTTMSILTGLFPPTSGTAYILGKDIRSEMSSIR 
QNLGVCPQHNVLFDMLTVEEHIWFYARLKGLSEKHVKAEMEQMALDVGLPPSKLKSKT 
SQLSGGMQRKLSVALAFVGGSKWILDEPTAGVDPYSRRGIWELLLKYRQGRTIILST 
HHMDEADILGDRIAIISHGKLCCVGSSLFLKNQLGTGYYLTLVKKDVESSLSSCRNSS 
STVSCLKKEDSVSQSSSDAGLGSDHESDTLTIDVSAISNLIRKHVSEARLVEDIGHEL 
TYVLPYEAAKEGAFVELFHEIDDRLSDLGI SSYGI SETTLEE I FLKVAEESGVDAETS 
DGTLPARRNRRAFGDKQSCLHPFTEDDAVDPNDSDIDPESRETDLLSGMDGKGSYQLK 
G WKLTQQQ F V ALLW KRLL I ARR S RKG F FAQ I VL P A VF VC I AL V FSLIVPPFGKYPSLE 
LQPWMYNEQYTFVSNDAPEDMGTQELLNALTKDPGFGTRCMEGNPIPDTPCLAGEEDW 
TISPVPQSIVDLFQNGNWTMKNPSPACQCSSDKIKKMLPVCPPGAGGLPPP" 

BASE COUNT 1119 a 1236 c 1310 g 1118 t 

ORIGIN 

Query Match 52.3%; Score 3600.4; DB 11; Length 4 783; 

Best Local Similarity 88.6%; Pred. No. 0; 

Matches 3 901; Conservative 0; Mismatches 501; Indels 0; Gaps 0; 
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Mill Mill III! I II I Mill INI llllllll I II I MM I llll I III MM III 

CAAACATGTCAGCTGTTACTGGAAGTGGCCTGGCCTCTCTTTATCTTCCTGATCCTGATC 441 
TCTGTTCGGCTGAGCTACCCACCCTATGAACAACATGAATGCCATTTTCCAAATAAAGCC 120 

lllll II III! I II I MM I Ml lllllllllll Mill lllll lllllllll 

T CTGT ACG CCTG AG CT ACC C AC CCT ACGAAC AAC ATG AGTGCC ACTTT CCG AAT AAAG CC 501 
ATGCCCTCTGCAGGAACACTTCCTTGGGTTCAGGGGATTATCTGTAATGCCAACAACCCC 180 

lllll lllllllllll II II lllll MM I IIIMIMI lllll IIMM llll 

ATG CCGT CTGC AGG AACCCTCC C CTGGGT AC AGGGG ATT ATCTGT AATGC C AAC AACCCC 561 
TGTTTCCGTTACCCGACTCCTGGGGAGGCTCCCGGAGTTGTTGGAAACTTTAACAAATCC 24 0 

II llllllll II lllll II lllllllllll MIMMMIIIIIIIMII III 

TGCTTCCGTTATCCAACTCCCGGCGAGGCTCCCGGTGTTGTTGGAAACTTTAACAAGTCC 621 
ATTGTGGCTCGCCTGTTCTCAGATGCTCGGAGGCTTCTTTTATACAGCCAGAAAGACACC 300 

II Ml llllllllllllllll llll MM I llllllll I III III 

ATCGTGTCTCGCCTGTTCTCAGACGCTCAGAGGCTTCTTCTGTACAGCCAAAGAGATACC 681 
AGC ATG AAGG AC ATGCG C AAAG TT CTG AG AAC ATT AC AGC AG ATC AAG AAAT CC AG CT C A 360 

lllll IIIMIMI III II 1 1 1 M 1 1 llll lllll I III llll 

AGCATTAAGGACATGCACAAGGTCCTGAGAATGTTACGGCAGATCAAGCATCCCAACTCA 741 
AACTTGAAGCTTCAAGATTTCCTGGTGGACAATGAAACCTTCTCTGGGTTCCTGTATCAC 420 

ii i m 1 1 1 1 1 ii iiiii iiiiiiiiiiiiiiii iiiiiiii mill i 111 

AATTTGAAGCTCCAGGATTTTCTGGTGGACAATGAAACATTCTCTGGATTCCTGCAGCAC 801 
AACCTCTCTCTCCCAAAGTCTACTGTGGACAAGATGCTGAGGGCTGATGTCATTCTCCAC 4 80 

ii i ii 1 1 iiii iiiiiiiiiii iiiii 111 iiii mill 

AATTTGTCCCTTCCAAGATCTACTGTGGACAGCCTGCTGCAGGCGAATGTTGGTCTCCAG 861 
AAGGT ATTT TTGC AAGGC T AC C AG TT AC ATTTG AC AAGT CTGTG C AATGG AT C AAAAT C A 54 0 

IIMIIIIillllllll MM lllllllll I llllllll II llllllll I 

AAGGT ATTT T TGC AAGGC T AC C AATT AC ATTTGG CC AGT CTG TGT AACGG AT C AAAATT A 921 
GAAGAGATGATTCAACTTGGTGACCAAGAAGTTTCTGAGCTTTGTGGCCTACCAAGGGAG 600 

lllll II lllll llllllll IMMMI II lllll lllll III II 

GAAGAAATTATTCAGCTTGGTGATGCGGAAGTTTCTGCCCTCTGTGGTCTACCGAGGAAG 981 
AAACTGGCTGCAGCAGAGCGAGTACTTCGTTCCAACATGGACATCCTGAAGCCAATCCTG 660 

ii ii i linn mi iiimi n i iiiiiiM iniMiiiiiM i ii 

AAG CTCG ATG C AG CCG AG AG AGT ACTG CG CT AC AAC ATGG AC AT C CTG AAG CC AG TTGTG 1041 
AG AAC ACT AAACT CT AC AT CTCC CTT C CCG AG C AAGG AG C TGG C CG AAG C C AC AAAAAC A 720 

I II IIIMI II IMIIII IMIII I 1 1 I lllll IMMMI I I 

ACAAAACTAAATTCCACATCTCATCTCCCGACCCAGCATCTGGCTGAAGCCACCACAGTG 1101 
TTGCTGCATAGTCTTGGGACTCTGGCCCAGGAGCTGTTCAGCATGAGAAGCTGGAGTGAC 780 

Mill I II I III IMMMI i r 1 1 f 1 1 1 1 1 1 1 1 I IMMMI Ml 

TTGCTTGACAGCTTGGGGGGCCTGGCCCAAGAGCTGTTCAGCACAAAGAGCTGGAGCGAC 1161 
ATGCGACAGGAGGTGATGTTTCTGACCAATGTGAACAGCTCCAGCTCCTCCACCCAAATC 84 0 

Mill 1 1 1 ii ii 1 1 m ! 1 1 1 1 1 1 ii !! i : 1 1 : i i 1 1 1 1 1 1 1 1 1 1 1 1 ; 1 1 ; 1 1 1 mi 

ATG CGG C AGG AGG TG ATGTTTCTG AC C AACG TG AAC AG CTCC AG CTCCT C C ACCC AG ATC 1221 
TACCAGGCTGTGTCTCGTATTGTCTGCGGGCATCCCGAGGGAGGGGGGCTGAAGATCAAG 900 

IMMMI lllll II II II II II II II lllll Mill IMMMMMI 

T ACC AGG C AGTG TCCCG C AT CGTGTGTGGTC ACCC AG AGGGTGGGGG C CTG AAG AT C AAG 1281 

TCTCTCAACTGGTATGAGGACAACAACTACAAAGCCCTCTTTGGAGGCAATGGCACTGAG 960 

II lllllllllll lllll I I I I' M II II I I I II II l!M II I Ml III III 
TCCCTCAACTGGTACGAGGATAACAACTACAAAGCCCTCTTTGGAGGGAATAACACCGAG 1341 

G AAG ATGC TG AAACCTT CT ATG AC AACTCT AC AACTC CTT ACTGC AATG ATTTG ATG AAG 1020 

lllll I II llllllll M 1 1 1 1 1 1 1 1 1 1 1 1 MM 1 1 III II II !l II I 

GAAGACGTGGACAC CTT CTATGACAATTCTACAACTC CTT ATTGC AATG ATTTG ATG AAG 14 01 
AATTTGGAGTCTAGTCCTCTTTCCCGCATTATCTGGAAAGCTCTGAAGCCGCTGCTCGTT 1080 

II Ml MM IIIMI II II II II lllll lllll II II lllll lllll III 

AACTTGGAGTCTAGTCCTCTTTCTCGAATTATTTGGAAGGCACTCAAGCCACTGCTTGTT 14 61 



Qy 1081 GGGAAGATCCTGTATACACCTGACACTCCAGCCACAAGGCAGGTCATGGCTGAGGTGAAC 114 0 

II IIIII II llllllllllllll IIIII lllllllllllllllllllllllllll 

Db 1462 GGAAAGATTCTCTATACACCTGACACACCAGCTACAAGGCAGGTCATGGCTGAGGTGAAC 1521 

Qy 1141 AAG AC CTTCC AGG AACTGGCTGTGTTCC ATG ATCTGG AAGG C ATG TGGG AGG AACT C AG C 1200 

Illlllll Mill 1 1 1 1 1 1 1 1 1 1 IIIII lllllllllll lllllllll 

Db 1522 AAG AC CTTTC AGG AGTTGGCTG TGTTCC ATG AC CTGG AGGG C ATG TGGG AAG AACT C AG C 1581 

Qy 1201 CC C AAG ATCTGG AC CTT C ATGG AG AAC AG C C AAG AAATGG ACCTTGT C CGG ATGCTGTTG 1260 

III I II I II 1 1 1 1 IN 1 1 1 II II 1 1 1 1 M 1 1 1 1 1 1 1 1 f 1 1 f 1 1 1 1 1 1 MINI 

Db 1582 CCCCAAATTTGG ACC TTC ATGG AG AAC AGC C AAG AG ATGG ACCTTGT CCGGACGCTG TT A 1641 

Qy 1261 GACAGCAGGGACAATGACCACTTTTGGGAACAGCAGTTGGATGGCTTAGATTGGACAGCC 1320 

Illlllll I lllllllll llllllllllll llllllllll Mlllllllll III 
Db 1642 GACAGCAGAGGCAATGACCAGTTTTGGGAACAGAAGTTGGATGGATTAGATTGGACTGCC 1701 

Qy 1321 CAAGACATCGTGGCGTTTTTGGCCAAGCACCCAGAGGATGTCCAGTCCAGTAATGGTTCT 1380 

lllllllll Illlllll Illlllll lllllll Mil MINI IIIII III 

Db 1702 C AAG AC ATC ATGG CGTTTCTGGCC AAG AACCC AG AAG ATGCTCAGTCCCC AAATGG CTCT 1761 

Qy 1381 GTG T AC ACC TGG AG AG AAG CTTTC AACG AG ACT AAC C AGG C AAT CCGG AC C AT AT CT CG C 144 0 

IIIII llllllllllllllllllll Mill IMMMIIMM Ml IIIMIII 

Db 1762 GTGTAT ACC TGG AG AG AAG CTTT C AATG AG ACC AACC AGGC AAT C C AG ACG AT AT CT CG A 1821 

Qy 1441 TTC ATGG AGTG TGT C AAC CTG AAC AAG CT AG AAC CC AT AG C AAC AG AAG T CTGG CT CATC 1500 

MIMMMIIMMMMMMIIMM Illlllll I IIMIIMI M M II II 

Db 1822 TTC ATGG AGTG TGT CAACCTG AAC AAG CTGG AAC C CATT CCG AC AG AAG T C AGG CT C AT C 1881 

Qy 1501 AACAAGTCCATGGAGCTGCTGGATGAGAGGAAGTTCTGGGCTGGTATTGTGTTCACTGGA 156 0 

I MM h II I! II M II MM I! MMIMMMMMIMM II M 1 1 II M II 

Db 1882 AACAAGTCCATGGAGCTGCTGGACGAGAGGAAGTTCTGGGCTGGCATCGTGTTCACAGGC 1941 

Qy 1561 ATTACTCCAGGCAGCATTGAGCTGCCCCATCATGTCAAGTACAAGATCCGAATGGACATT 162 0 

II IIIIIM II I MMMMMMIMM MIMIMIIMM MINIMI 

Db 1942 ATC ACTCC AG AT AGTGTGG AGCTG C CC C ATC ATGT AAAGT AC AAG ATC CGG ATGG AC ATT 2 001 

Qy 1621 G AC AATGTGG AG AGGACAAAT AAAAT C AAGG ATGGGT ACTGGG AC C C TGGT C CTCG AG C T 1680 

IMM Illlllll II Mill I M i M 1 1 1 1 1 1 1 1 1 1 MM II Ml M II II III 

Db 2002 G AC AACGTGG AG AG AACT AAT AAG AT C AAGG ATGGGT ACTGGG ACC CTGGT C CTCGGGCT 2061 

Qy 1681 GACCCCTTTGAGGACATGCGGTACGTCTGGGGGGGCTTCGCCTACTTGCAGGATGTGGTG 174 0 

IIIII Mill II IIIII II IIIIIMI M I MM M M II II II M II II II II 

Db 2062 GACCCTTTTGAAGATATGCGCTATGTCTGGGGCGGCTTCGCCTACTTGCAGGATGTGGTG 2121 

Qy 1741 GAG C AGG C AATC ATC AGGGTGCTG ACGGG C ACCG AG AAG AAAACTGGTGTCT AT ATG C AA 1800 

II Mill Illlllll lllllllllll I IMMIIMM IIIIIMI IIIII 

Db 2122 G AAC AGG C C ATC ATC AG AGTGCTG ACGGG AT CTG AG AAG AAAACGGGTGT CT ACG TG CAA 2181 

Qy 1801 CAG ATGC C CT ATC C CTGTT ACGTTG ATG AC AT CTTT CTG CGGGTG ATG AG C CGGT C AATG 1860 

IMIIIIIMI 1 1 i 1 1 1 1 1 IMMMIMI IMMIIMM IMIIMIIMMII 

Db 2182 CAG ATGC CCT ACC C CTGTT ATG TTG ATG ACATTTTT CTGCGGGT C ATG AG C CGGT C AATG 2241 

Qy 18 61 CCCCTCTTCATGACGCTGGCCTGGATTTACTCAGTGGCTGTGATCATCAAGGGCATCGTG 1920 

M 1 1 1 1 II ii Ml I II Illlllll IIIII II IIIIMMMMIII MM Ml 

Db 224 2 C CCCT CTT C ATG ACTCT AG C CTGG AT CT ACT CTGT CG CTGTG ATC ATC AAG AG C ATTG TG 2301 

Qy 1921 T ATG AG AAGG AGG C ACGG C TG AAAG AG ACC ATGCGG ATC ATGGG C CTGG AC AAC AG CATC 1980 

IMMMIMMII Illlllll I II I M M 1 1 M 1 II 1 1 1 1 1 Illlllll Mill 

Db 2302 T ATG AG AAGG AGG C T CGGCTG AAGG AG ACC ATG CGG ATC ATGGGT CTGG AC AATGG CAT C 2361 

Qy 1981 CTCTGGTTTAGCTGGTTCATTAGTAGCCTCATTCCTCTTCTTGTGAGCGCTGGCCTGCTA 204 0 

MMMMMMIMM MM IIIIIMI Mill 1 1 1 1 M M I M M 1 1 1 1 1 M 

Db 2362 CTCTGGTTTAGCTGGTTTGTTAGCAGCCTCATCCCTCTGCTTGTGAGCGCTGGCCTGCTG 2421 

Qy 2041 GTGGTCATCCTGAAGTTAGGAAACCTGCTGCCCTACAGTGATCCCAGCGTGGTGTTTGTC 2100 

lllllllll 1 1 1 1 1 M II 1 1 1 II ! I M M M 1 1 1 Mill MIMMIMIIM III 

Db 24 22 GTGGTCATCTTGAAGTTAGGAAACCTGCTGCCCTATAGTGACCCCAGCGTGGTGTTCGTC 2481 

Qy 2101 TTCCTGTCCGTGTTTGCTGTGGTGACAATCCTGCAGTGCTTCCTGATTAGCACACTCTTC 2160 

IIIIIMI IIIMIII IIIIIM IIIII IMMIIMM Illlllll llllll 

Db 24 82 TTCCTGTCTGTGTTTGCCATGGTGACCATCCTACAGTGCTTCCTCATTAGCACGCTCTTC 2541 
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TCCAGAGCCAACCTGGCAGCAGCCTGTGGGGGCATCATCTACTTCACGCTGTACCTGCCC 22 20 

III I III I Mil MIIIIHII Mill IIIIMIII I IIII I III 1 1 Mil Mill II 

TCCCGTGCCAACCTGGCAGCAGCCTGTGGGGGCATCATCTACTTCACGCTGTACCTGCCC 2 601 
TACGTCCTGTGTGTGGCATGGCAGGACTACGTGGGCTTCACACTCAAGATCTTCGCTAGC 2280 

II II Mill II II lllllllllll lllllllll I llllllllll llllll 

TATGTGCTGTGCGTAGCCTGGCAGGACTATGTGGGCTTCTCCATCAAGATCTTTGCTAGC 2661 
CTGCTGTCTCCTGTGGCTTTTGGGTTTGGCTGTGAGTACTTTGCCCTTTTTGAGGAGCAG 2340 

lllllllllll lllllll ll lllllllllll II llllllll IMIIIII 

CTGCTGTCTCCTGTGGCTTTTGGATTCGGCTGTGAGTATTTCGCCCTTTTCGAGGAGCAA 2721 
GGCATTGGAGTGCAGTGGGACAACCTGTTTGAGAGTCCTGTGGAGGAAGATGGCTTCAAT 24 00 

II II II II II llllllll II llllllll II MIMIII II IMIIIMI 

GGTATCGGGGTCCAATGGGACAATCTCTTTGAGAGCCCGGTGGAGGAGGACGGCTTCAAT 2 781 

CTCACCACTTCGGTCTCCATGATGCTGTTTGACACCTTCCTCTATGGGGTGATGACCTGG 2460 

lllllllll I II lllllllllll lllllllllll llllllll llllllll III 
CTC AC C ACTG C AG TGTC C ATGATG CT C TT TG AC ACCTTT CTCT ATGG CGTG ATG AC ATGG 2 841 

TACATTGAGGCTGTCTTTCCAGGCCAGTACGGAATTCCCAGGCCCTGGTATTTTCCTTGC 2520 

Mill II II MIMIIMM Mill llllllllllll:! IIMI IIII IIIII 

TACATCGAAGCCGTCTTTCCAGGACAGTATGGAATTCCCAGGCCCTGGTATTTTCCTTGT 2901 
ACCAAGTCCTACTGGTTTGGCGAGGAAAGTGATGAGAAGAGCCACCCTGGTTCCAACCAG 2580 

llllllll lllllllllll lllllll IMMIIIIIIIIIMIIIIIimi IIII 

ACCAAGTCATACTGGTTTGGTGAGGAAATTGATGAGAAGAGCCACCCTGGTTCCAGCCAG 2961 

AAG AG AAT AT CAG AAATCTG C ATGG AGGAGG AACC C AC C CACTTG AAG CTGGGCGTGTCC 264 0 

III II I IIIIIIIIIIIIIIIII lllllllllll II III llllll llllll 
AAGGGAGTGTCAGAAATCTGCATGGAAGAGGAACCCACTCATCTGAGGCTGGGGGTGTCC 3021 

ATT C AG AAC CTGGT AAAAG T CTAC CG AG ATGGG ATG AAGGTGG CTG T CG ATGGC CTGGC A 2 700 

iiiiiiiiiiiiii ii ii iiiiiiiiiii iiiiiiii iiiii Mini mi 

ATT C AG AAC CTGGTG AAGG TTT AC CG AG ATGG C ATG AAGGTTG CTGTGG ATGG CTTGG CG 3081 
CTG AATTTTT ATG AGGG CC AG ATC AC CT C CTT C C TGGG C CAC AATGG AG CGGGG AAG ACG 2 760 

n ii iiiii ii iiiiiiii iiiimmiiiiimiiimii iiiiiiii 

CTC AACTTTT ACG AAGG C CAG ATT ACCTC CTT CCTGGGC CAC AATGG AG C AGGG AAG AC C 3141 

ACCACCATGTCAATCCTGACCGGGTTGTTCCCCCCGACCTCGGGCACCGCCTACATCCTG 2820 

IIIIIIIIIIIIII IIIII III IIII IIIII II II IIIII llllllllllll 
ACCACCATGTCAATACTGACTGGGCTGTTTCCCCCAACTTCTGGCACGGCCTACATCCTG 32 01 

GGAAAAGACATTCGCTCTGAGATGAGCACCATCCGGCAGAACCTGGGGGTCTGTCCCCAG 2880 

II II lllllllllll lllllllll lllllllllllllllllll llllllllllll 
GGGAAGGACATTCGCTCGGAGATGAGCTCCATCCGGCAGAACCTGGGAGTCTGTCCCCAG 3261 

CATAACGTGCTGTTTGACATGCTGACTGTCGAAGAACACATCTGGTTCTATGCCCGCTTG 2940 

IIIII lllllllllllllllllllllllllllll IIIIIIIIIIIIIIIII III I 
CATAATGTGCTGTTTGACATGCTGACTGTCGAAGAGCACATCTGGTTCTATGCGCGCCTA 3321 

AAAGGG CTCT CTG AG AAGCACGTG AAGG CGGAGATGG AG CAG ATGG CCCTGGATGTTGGT 3000 

II llllllll IIIIIIIIIIIIII II lllllllllllllllllllllllllllll 
AAGGGG C TCT C AG AG AAG C ACGTG AAAGC AG AG ATGG AG CAG ATGG C C CTGGATG TTGGC 3381 

TTGCCATCAAGCAAGCTGAAAAGCAAAACAAGCCAGCTGTCAGGTGGAATGCAGAGAAAG 3060 

II II I Ml II MM 1 1 1 1 II II I II II IIIII llllllll llllllllllll 

TT AC CC CCG AGC AAG CTG AAAAGC AAAACG AGTC AG CT CTC AGGTGGG ATG CAG AG AAAG 3441 

CT AT CTGTGG CCTTGG C CTT TGTCGGGGG ATCT AAGGTTGTC ATTC TGG ATGAAC CC AC A 3120 

II IIIIIIIIIIIIIIIII II II IIIII IIIIIIIIIIIIIIIII II llllll 
CTGTCTGTGGCCTTGGCCTTCGTGGGTGGATCCAAGGTTGTCATTCTGGACGAGCCCACA 3501 

GCTGG TG TGG AC CCTT ACTCC CGC AGGGG AAT ATGGG AGCTG CTG CTG AAATAC CG AC AA 3180 

II II llllllll IIIII IIIIIIIIIIIIIIIII II IIIII llllllll III 
G CCGGGGTGG ACC CGT AC T CT CGC AGGGG AAT ATGGG AACT C CTG CT AAAAT ACCGG C AA 3561 

3181 GGCCGCACCATTATTCTCTCTACACACCACATGGATGAAGCGGACGTCCTGGGGGACAGG 324 0 

1 1 1 1 1 1 1 1 1 F 1 1 1 1 1 I IIIIIIIIIIIIIIIII IIIII III IIII llllllll 



Db 


3562 


Qy 


3241 


Db 


3622 


Qy 


3301 


Db 


3682 


Qy 


3361 


Db 


3742 


Qy 


3421 


Db 


3802 


Qy 


3481 


Db 


3862 


Qy 


3541 


Db 


3922 


Qy 


3601 


Db 


3982 


Qy 


3661 


Db 


4042 


Qy 


3721 


Db 


4102 


Qy 


3781 


Db 


4162 


Qy 


3841 


Db 


4222 


Qy 


3901 


Db 


4282 


Qy 


3961 


Db 


4342 


Qy 


4021 


Db 


4402 


Qy 


4081 


Db 


4462 


Qy 


4141 


Db 


4522 


Qy 


4201 


Db 


4582 


Qy 


4261 



3562 GG C CG C ACC ATT ATTTTGTCT AC AC AC CAC ATGG ACG AAGCTG AC AT C CTTGGGG AC AG A 3621 
ATTGCCATCATCTCCCATGGGAAGCTGTGCTGTGTGGGCTCCTCCCTGTTTCTGAAGAAC 3300 

milium iiiiiiiiiiiiiiiii iiiiiiiiiiiiiiiiiiiii mi in 

ATTGCCATCATTTCCCATGGGAAGCTGTGTTGTGTGGGCTCCTCCCTGTTTTTGAAAAAC 3681 
CAGCTGGGAACAGGCTACTACCTGACCTTGGTCAAGAAAGATGTGGAATCCTCCCTCAGT 3360 

III IIIIIII II IIIII MINI 1 1 II MIIMMII1! 1 1 1 1 1 IIIIIIIII 

CAGTTGGG AACGGGTTACTATCTGACCCTGGTTAAGAAAGATGTGGAATCGTCCCTCAGT 3741 
TCCTGCAGAAACAGTAGTAGCACTGTGTCATACCTGAAAAAGGAGGACAGTGTTTCTCAG 3420 

IIIIIIIIIIIIIIIII IIIII IIIII I M i illl IMM 1 1 II M II II II M I 

TCCTGCAGAAACAGTAGCAGCACCGTGTCTTGTCTGAAAAAGGAGGACAGTGTTTCTCAG 3801 
AG C AGTTC TG ATG CTGG C CTGGGC AG CGAC C ATG AG AGTG AC ACG CTG AC C ATCG ATGTC 3480 

Ml II Mill II 1 1 1 1 MMMMI II IMMI I IMIIIIIIIIIMMIIIIIIII 

AG C AGTT CTG ATG C TGG C C TGGGC AG CGAC C ATG AAAGTG AC ACG CTG AC C ATCG ATGTC 3861 
TCTGCTATCTCCAACCTCATCAGGAAGCATGTGTCTGAAGCCCGGCTGGTGGAAGACATA 354 0 

iiiiminiiniiiiiiiiiiinii iiiniiiniiiiiiiniin inn 

TCTGCTATCTCCAACCTCATCAGGAAGCACGTGTCTGAAGCCCGGCTGGTGGAGGACATT 3 921 
GGG C ATG AG CTG ACCT ATGTG CTG C C ATATG AAGC TGCT AAGG AGGG AG C CTTTGTGG AA 3600 

iiiii mi mm iiii iiiiMii n inn n imiiiinnniiniii 

GGG C ACG AGCTG ACCT ATGTG CTG CCGTACGAAGCCGCG AAGG AGGG AG CCTTTGTGGAA 3 981 
C TCTTT C ATG AG ATTG ATG AC CGG CT CTC AG ACCTGGGC ATTTCT AGTT ATGG CAT C T C A 3660 

IIIII i IMMI II MM hi MM I Ml IM M I! II I || IIIMIIIMIIM 

CTCTT C C ATG AG AT TG ATG AC CGG CT CTC AG ACCTGGGC ATCT C C AGTT ATGG CAT C T CG 4 041 
GAGACGACCCTGGAAGAAATATTCCTCAAGGTGGCCGAAGAGAGTGGGGTGGATGCTGAG 3 72 0 

iiiii iiimmiiiiiiiiiiiiii inn iiiiiiii 1 1 1 . 1 1 m ' 1 1 1 1 1 

G AG AC C ACC CTGG AAG AAAT ATTC CT C AAAGTGG CTG AAG AG AG CGGGG TGG ATG CTGAG 4101 
ACCTCAGATGGTACCTTGCCAGCAAGACGAAACAGGCGGGCCTTCGGGGACAAGCAGAGC 3780 

iiiiiiiiiiMii Miiiiiiiiiiiimm iiiiiiiiiiiiiiiiiiiiiiii 

ACCTCAGATGGTACTTTGCCAGCAAGACGAAACAGACGGGCCTTCGGGGACAAGCAGAGC 4161 
TGTCTTCGCCCGTTCACTGAAGATGATGCTGCTGATCCAAATGATTCTGACATAGACCCA 384 0 

inn i in n ii mimiimi mill mn iiiiiiii iiiiiii 

TGTCTGCACCCATTTACGGAAGATGATGCTGTTGATCCCAATGACTCTGACATAGACCCA 4221 
G AATC C AG AG AG AC AG ACTTG CT C AG TGGG ATGG ATGG C AAAGGG T C C T ACC AGG TG AAA 3900 

iiiiiiii iiiii mi iiimiiiiiimi iiiiiiii iiiiiiiii mi 

G AATC C AGGG AG ACCG AC C TG CT CAGTGGG ATGG ACGG C AAAGG CT C C T ACC AG CTG AAG 4 2 81 
GGCTGGAAACTTACACAGCAACAGTTTGTGGCCCTTTTGTGGAAGAGACTGCTAATTGCC 3960 

1 1 1 1 1 1 : 1 1 1 i ii Miiiiiiiiiiiiiiimiiiiimiiii iiiii iinii 

GGCTGGAAACTCACCCAGCAACAGTTTGTGGCCCTTTTGTGGAAGAGGCTGCTGATTGCC 4 341 
AG ACGG AGTCGG AAAGG ATTTTTTGCTCAGATTGTCTTGCC AG CTGTGTTTGTCTGC ATT 4 02 0 

llllllll IIIII II II IIIIIIIIIMIIII MIMIMII IIIII MUM 

AGACGGAGCCGGAAGGGTTTCTTTGCTCAGATTGTCCTGCCAGCTGTCTTTGTTTGCATT 4401 
GCCCTTGTGTTCAGCCTGATCGTGCCACCCTTTGGCAAGTACCCCAGCCTGGAACTTCAG 4 080 

IIIII II IIIIIIMII I III I II II MMMIMIMMM MIMI Illl I II 

GCCCTGGTCTTCAGCCTGATTGTGCCACCCTTTGGCAAGTACCCCAGCCTGGAACTTCAG 4461 
CCCTGGATGTACAACGAACAGTACACATTTGTCAGCAATGATGCTCCTGAGGACACGGGA 4140 

IIIIIIMII II II IIIII lllllllllll MIMIMII IIIIIII III 

CCCTGG ATGT AT AATG AG C AGT AT AC ATTTG TC AGT AATG ATG CT CCCG AGG AC ATGGG C 4 521 
ACCCTGGAACTCTTAAACGCCCTCACCAAAGACCCTGGCTTCGGGACCCGCTGTATGGAA 4 200 

Illl IIIIIII I II II II llllllll II IIIII 1 1 M 1 1 Ml 1 1 1 1 II ! M 

AC CC AGG AACT CCTG AATG CTCTG AC C AAAG ATCC AGGCTTTGGG ACC CG CTG T ATGG AA 4 581 

GG AAAC C C AAT CCC AG AC ACG C C CTG C C AGG C AGGGG AGG AAG AGTGG AC C ACTG C C C C A 4 260 

IIIIIIIIIIIIIIIII II II III III llllllll II IIIIIII III 
GG AAAC CC AATCCC AG AT ACC C CTTG CTTGG CTGGGG AGG AGG ACTGG ACC AT C AGCCCC 4 641 

GTTCCCCAGACCATCATGGACCTCTTCCAGAATGGGAACTGGACAATGCAGAACCCTTCA 4320 




II lllllll Mil II II 1 1 II II II 1 1 1 1 1 1 1 MINIM Ml llllll! Ill 

Db 4 64 2 GT CC CCC AG AG C ATCGTGG AC CTCTTC C AG AATGG AAACTGG ACC ATG AAG AAC CC CT C A 4 701 

Qy 4 321 C CTG C ATG C C AGTGT AG C AGCG AC AAAATC AAG AAG ATG CTG CCTGTGTG TC C CC CAGGG 4 3 80 

inn ii mi i iiiiiii iiiiiiiiiiiiiiiiiiiiini iiiiiiiiiiiii 

Db 4 702 CCTGCGTGCCAGTGTAGCAGTGACAAAATCAAGAAGATGCTGCCTGTGTGTCCCCCAGGG 4 761 

Qy 4381 GCAGGGGGGCTGCCTCCTCCAC 44 02 

I I I I II I I I I I I I I Mill I 
Db 4 762 GC AGGGGGGCTGCCACCTCCTC 4783 



SEQ ID NO: 2 



Result Query- 
No. Score Match Length DB ID Description 



1 


11469 


100, 


.0 


2201 


21 


AAY79380 


Human ATP binding 


2 


11469 


100. 


.0 


2201 


22 


AAE13021 


Human ATP binding 


3 


11469 


100. 


.0 


2201 


22 


AAM50227 


Human ATP binding 


4 


11469 


100. 


.0 


2201 


23 


ABP65164 


Hypoxia- regulated 


5 


11469 


100. 


.0 


2261 


22 


AAE13022 


Human ATP binding 


6 


11469 


100. 


.0 


2261 


22 


AAM50228 


Human ATP binding 


7 


11469 


100. 


.0 


2261 


22 


AAU02176 


Human ABC1 . Homo 


8 


11469 


100. 


.0 


2261 


22 


AAU02177 


Human ABC1 mutant 


9 


11469 


100. 


.0 


2261 


23 


ABB83111 


Human ABCA1 transp 


10 


11469 


100. 


.0 


2261 


23 


ABP52092 


Homo sapiens ABC t 


11 


11469 


100. 


.0 


2261 


23 


AAE23000 


Human ABC1 full-le 


12 


11468 


100. 


.0 


2261 


23 


ABB83117 


Polymorphic human 


13 


11468 


100. 


.0 


2261 


23 


ABB83124 


Polymorphic human 


14 


11466 


100. 


.0 


2261 


22 


AAU02183 


Human ABC1 mutant 


15 


11466 


100. 


.0 


2261 


22 


AAU02188 


Human ABC1 mutant 


16 


11466 


100 . 


. 0 


2261 


23 


ABB83115 


Polymorphic human 


17 


11466 


100 . 


. 0 


2261 


23 


ABB83116 


Polymorphic human 


18 


11466 


100 , 


.0 


2261 


23 


ABB83119 


Polymorphic human 


19 


11466 


100. 


.0 


2261 


23 


ABB83122 


Polymorphic human 


20 


11466 


100, 


.0 


2261 


23 


ABB83123 


Polymorphic human 


21 


11465 


100, 


,0 


2261 


22 


AAU02189 


Human ABC1 mutant 


22 


11464 


100. 


.0 


2261 


23 


ABB83121 


Polymorphic human 


23 


11463 


99. 


,9 


2261 


22 


AAU02181 


Human ABC1 mutant 


24 


11462 


99, 


.9 


2261 


22 


AAM78550 


Human protein SEQ 


25 


11462 


99. 


.9 


2263 


22 


ABB11956 


Human ABCA1 homolo 


26 


11462 


99. 


. 9 


2263 


22 


AAM79534 


Human protein SEQ 


27 


11461 


99, 


.9 


2261 


22 


AAU02182 


Human ABC1 mutant 


28 


11461 


99. 


.9 


2261 


22 


AAU02186 


Human ABC1 mutant 


29 


11459 


99. 


.9 


2261 


23 


ABB83118 


Polymorphic human 


30 


11458 


99. 


.9 


2261 


23 


ABB83120 


Polymorphic human 


31 


11440 


99. 


. 7 


2261 


21 


AAB38082 


Human ABC1 cholest 


32 


11440 


99, 


. 7 


2261 


22 


AAB71749 


Human ABC1 protein 


33 


11440 


99. 


. 7 


2261 


24 


ABU11899 


Human ATP -binding 


34 


11439 


99. 


.7 


2261 


23 


ABB81578 


Human ABC -A- 1-1 pr 


35 


11437 


99. 


.7 


2261 


21 


AAB38109 


Human ABC1 cholest 


36 


11437 


99. 


. 7 


2261 


21 


AAB38111 


Human ABC1 cholest 


37 


11437 


99. 


.7 


2261 


21 


AAB38114 


Human ABC1 cholest 


38 


11437 


99. 


. 7 


2261 


21 


AAB38115 


Human ABC1 cholest 


39 


11437 


99. 


. 7 


2261 


21 


AAB38117 


Human ABC1 cholest 


No. 


Score 


Match Length 


DB 


ID 


Description 


1 


6909 


60. 


2 


1375 


3 


US-08-665-259-26 


Sequence 26, Appl 


2 


6909 


60. 


.2 


1375 


3 


US-08-762-500-26 


Sequence 26, Appl 


3 


3129.5 


27. 


.3 


1457 


3 


US-08-665-259-27 


Sequence 27, Appl 


4 


3129.5 


27. 


.3 


1457 


3 


US-08-762-500-27 


Sequence 27, Appl 


5 


2635.5 


23. 


.0 


1684 


3 


US-08-665-259-25 


Sequence 25, Appl 


6 


2635.5 


23. 


.0 


1684 


3 


US-08-762-500-25 


Sequence 25 , Appl 


7 


2635.5 


23. 


.0 


1704 


3 


US-08-762-500-75 


Sequence 75, Appl 



8 


359 


3 


1 


315 


4 


US-09-328- 


352-4388 


Sequence 


4388, Ap 


9 


354 


3 


1 


332 


4 


US-09-107- 


532A-3752 


Sequence 


3752, Ap 


10 


352 .5 


3 


1 


335 


4 


US-09-252- 


991A-20837 


Sequence 


20837, A 


11 


352 


3 


1 


322 


4 


US-09-107- 


532A-4662 


Sequence 


4 662, Ap 


12 


346 


3 


0 


589 


4 


US-09-328- 


352-7592 


Sequence 


7592, Ap 


13 


336.5 


2 


9 


1280 


2 


US-08-583- 


276-19 


Sequence 


19, Appl 


14 


335.5 


2 


9 


291 


4 


US-09-107- 


532A-4205 


Sequence 


4 2 05, Ap 


15 


335.5 


2 


9 


1280 


4 


US-09-767- 


594-2 


Sequence 


2, Appli 


16 


335.5 


2 


9 


1280 


6 


5206352-4 




Patent No 


5206352 


17 


333 


2 


9 


1279 


2 


US-08-784- 


649A-2 


Sequence 


2, Appli 


18 


332 


2 


9 


402 


4 


US-09-107- 


532A-5360 


Sequence 


5360, Ap 


19 


330.5 


2 


9 


1280 


2 


US-08-752- 


447-2 


Sequence 


2, Appli 



RESULT 1 

US-08-665-259-26 

Sequence 26, Application US/08665259 
Patent No. 6028173 
GENERAL INFORMATION: 

APPLICANT: Landes, Gregory M . 
APPLICANT: Burn, Timothy C. 
APPLICANT: Connors, Timothy D. 
APPLICANT: Dackowski, William R. 
APPLICANT: Van Raay, Terence J. 
APPLICANT: Klinger, Katherine W. 

TITLE OF INVENTION: NOVEL HUMAN CHROMOSOME 16 GENES, 

TITLE OF INVENTION: COMPOSITIONS, METHODS OF MAKING AND USING SAME 
NUMBER OF SEQUENCES: 73 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: GENZYME CORPORATION 
STREET : One Mountain Road 
CITY : Framingham 
STATE: Massachusetts 
COUNTRY: United States of America 
ZIP : 01701 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC -DOS /MS -DOS 
SOFTWARE: Patent In Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/665 , 259 
FILING DATE: 17-JUN-1996 
CLASSIFICATION: 435 
ATTORNEY /AGENT INFORMATION: 
NAME: Dugan, Deborah A. 
REGISTRATION NUMBER: 37,315 
REFERENCE /DOCKET NUMBER: IG5-9.1 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (508) 872-8400 
TELEFAX: (508) 872-5415 
INFORMATION FOR SEQ ID NO: 26: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 1375 amino acids 
TYPE: amino acid 
STRANDEDNESS : not relevant 
TOPOLOGY : unknown 
MOLECULE TYPE: protein 
US-08-665-259-26 

Query Match 60.2%; Score 6909; DB 3; Length 1375; 

Best Local Similarity 96.9%; Pred. No. 0; 

Matches 1332; Conservative 21; Mismatches 22; Indels 0; Gaps 0 
Qy 827 CMEEE PTHLKLGVS IQNLVKVYRDGMKVAVDGLALNFYEGQI TS FLGHNGAGKTTTMS I L 886 

1 1 1 1 1 1 1 1 !: 1 1 ! I 1 1 1 I I 1 1 1 1 1 1 1 1 1! 1 1 II II 1 1 1 II 1 1 1 1 1 1 1 1 II ! I 1 1 1 I I I I i 

Db 1 CMEEEPTHLRLGVS I QNLVKVYRDGMKVAVDGLALNFYEGQITS FLGHNGAGKTTTMS I L 60 



Qy 



887 TGLFPPTSGTAYILGKDIRSEMSTIRQNLGVCPQHNVLFDMLTVEEHIWFYARLKGLSEK 946 

II II II II III! 1 1 1 1 1 I II MM llllllll I! MIIMIIIII III! llll II 1 1 1 1 




Db 


61 


TGLFPPTSGTAYII/GKDIRSEMSSIRQNLGVCPQHNVLFDMLTVEEHIWFYARLKGLSEK 


120 


Qy 


94 7 


HVKAEMEQMALDVGLPSSKLKSKTSQLSGGMQRKLSVALAFVGGSKWILDEPTAGVDPY 


1006 






1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 ! 1 1 1 1 II 1 1 II 1 1 1 1 II 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


121 


HVKAEMEQMALDVGLPPSKLKSKTSQLSGGMQRKLSVALAFVGGSKWILDEPTAGVDPY 


180 


Qy 


1007 


SRRGIWELLLKYRQGRTI ILSTHHMDEADVLGDRIAI ISHGKLCCVGSSLFLKNQLGTGY 


1066 




ii INI INI II II 1 1 1 1 III II II II Ihi Mil II II II II II 1 1 II 1 III II Mill 




Db 


181 


SRRGIWELLLKYRQGRTI I LSTHHMDEADILGDRIAI ISHGKLCCVGSSLFLKNQLGTGY 


240 


Qy 


1067 


YLTLVKKDVESSLSSCRNSSSTVSYLKKEDSVSQSSSDAGLGSDHESDTLTIDVSAISNL 


1126 






II II 1 MMMI II 1 1 M 1 1 1 1 M 1; II II II II II ti in; II li M II II II MM 1 




Db 


241 


YLTLVKKDVESSLSSCRNSSSTVSCLKKEDSVSQSSSDAGLGSDHESDTLTIDVSAISNL 


300 


Qy 


1127 


IRKHVSEARLVEDIGHELTYVLPYEAAKEGAFVELFHEIDDRLSDLGISSYGISETTLEE 


1186 




1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! II 1 1 1 t II 1 1 1 I 1 ! 1 1 1 1 

1 II II I 1 II ! 1 1 II 1 1 II 1 ! II II 1 II 1 1 1 II II II II 1 II i I 1 1 1 II I II II 1 1 1 1 II 1 




Db 


301 


IRKHVSEARLVEDIGHELTYVLPYEAAKEGAFVELFHEIDDRLSDLGISSYGISETTLEE 


360 


Qy 


1187 


IFLKVAEESGVDAETSDGTLPARRNRRAFGDKQSCLRPFTEDDAADPNDSDIDPESRETD 


1246 






llhll II II II II II 1 1 1 1 M 1 1 1 1 1 II IMMM IMIIII I 1 II II MM II Ml 




Db 


361 


IFLKVAEESGVDAETSDGTLPARRNRRAFGDKQSCLHPFTEDDAVDPNDSDIDPESRETD 


420 


Qy 


1247 


LLSGMDGKGSYQVKGWKLTQQQFVALLWKRLLIARRSRKGFFAQIVLPAVFVCIALVFSL 

II IMI M II I i M 1 Mm 1 1 1 M 1 1 M 1 II II II 1 iM M II M II M II II Mil 


1306 


Db 


421 


LLSGMDGKGSYQLKGWKLTQQQFVALLWKRLLIARRSRKGFFAQIVLPAVFVCIALVFSL 


480 


Qy 


1307 


IVPPFGKYPSLELQPWMYNEQYTFVSNDAPEDTGTLELLNALTKDPGFGTRCMEGNPIPD 


1366 






II II II M II 1 MM MM 1 1 1 M ! 1 M 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 




Db 


481 


IVPPFGKYPSLELQPWMYNEQYTFVSNDAPEDMGTQELLNALTKDPGFGTRCMEGNPIPD 


540 


Qy 


1367 


TPCQAGEEEWTTAPVPQTIMDLFQNGNWTMQNPSPACQCSSDKIKKMLPVCPPGAGGLPP 


1426 






III Nihil M 1 1 Ml M 1 1 1 1 1 M':MI M II II II IM ' IMM 1 II MM M 1 




Db 


541 


TPCLAGEEDWTISPVPQSIVDLFQNGNWTMKNPSPACQCSSDKIKKMLPVCPPGAGGLPP 


600 


Qy 


1427 


PQRKQNTADILQDLTGRNI SDYLVKTYVQI IAKSLKNKIWVNEFRYGGFSLGVSNTQALP 


1486 






Mill MiMIM MM 1 1 1 1 1 IM 1 II II II 1 IM II M II II II II II MM M M 1 




Db 


601 


PQRKQKTADILQNLTGRNISDYLVKTYVQIIAKSLKNKIWVNEFRYGGFSLGVSNSQALP 


660 


Qy 


1487 


PSQEVNDATKQMKKHLKLAKDSSADRFLNSLGRFMTGLDTRNNVKVWFNNKGWHAISSFL 


1546 






II Mill Mill III IhMIMhMIIII Ml MMuM IMI II MM II 1 




Db 


661 


PSHEVNDAI KQMKKLLKLTKDTS ADRFLSSLGRFMAGLDTKNNVKVWFNNKGWHAI SSFL 


720 


Qy 


1547 


NVINNAILRANLQKGENPSHYGITAFNHPLNLTKQQLSEVAPMTTSVDVLVSICVIFAMS 


1606 






MIIMIIIIMIIIIIM 1! 1 1 M 1 II 1 1 M II M II M II 1 M 1 1 1 M 1 1 1 1 1 1 1 1 




Db 


721 


NVINNAILRANLQKGENPSQYGITAFNHPLNLTKQQLSEVALMTTSVDVLVSICVIFAMS 


780 


Qy 


1607 


FVPASFWFLIQERVSKAKHLQFISGVKPVIYWLSNFVWDMCNYWPATLVIIIFICFQQ 


1666 






II II III IMM II II II M II MM 1 1 M 1 M M I! M II M II MM MM M IMM 1 




Db 


781 


FVPASFWFLIQERVSKAKHLQFISGVKPVIYWLSNFVWDMCNYWPATLVIIIFICFQQ 


840 


Qy 


1667 


KSYVSSTNLPVLALLLLLYGWSITPLMYPASFVFKIPSTAYWLTSVNLFIGINGSVATF 


1726 




M II II MM II II M II II II II II M 1 M 1 1 1 1 II MM II II II MM MM II III 




Db 


841 


KSYVSSTNLPVLALLLLLYGWSITPLMYPASFVFKIPSTAYWLTSVNLFIGINGSVATF 


900 


Qy 


1727 


VLELFTDNKLNNINDILKSVFLIFPHFCLGRGLIDMVKNQAMADALERFGENRFVSPLSW 


1786 






II II IMMI MM MM II ii II II M 1 1 M 1 1 1 II MM II II II 1 1 II II M II III 




Db 


901 


VLELFTNNKLNDINDILKSVFLIFPHFCLGRGLIDMVKNQAMADALERFGENRFVSPLSW 


960 


Qy 


1787 


DLVGRNLFAMAVEGWFFLITVLIQYRFFIRPRPVNAKLSPLNDEDEDVRRERQRILDGG 


1846 






MIIIMI II II II II II II II I! II II 1 1 ! 1 11 Ml 1 II M II II II II II II il 1 




Db 


961 


DLVGRNLFAMAVEGWFFLITVLIQYRFFIRPRPVKAKLPPLNDEDEDVRRERQRILDGG 


1020 


Qy 


1847 


GQND I LE I KE LTK I YRR KRKP AVD R I C VG I P PG E C FG LLG VNG AG KS S T F KMLTGDTT VT 


1906 






II II II II II IMMI 1 1 MM II II MM 1 1 1 1 MM 1 II II II II M II II MM II 




Db 


1021 


GQNDILEIKELTKIYRRKRKPAVDRICIGIPPGECFGLLGVNGAGKSTTFKMLTGDTPVT 


1080 


Qy 


1907 


RGDAFLNRNSILSNIHEVHQNMGYCPQFDAITELLTGREHVEFFALLRGVPEKEVGKVGE 


1966 






1 MM 1 1 M 1 II MM 1 1 MM MM II 1 Mill II III 1 III Ell 1 II 1 III II II II 




Db 


1081 


RGDAFLNKNSILSNIHEVHQNMGYCPQFDAITELLTGREHVEFFALLRGVPEKEVGKFGE 


1140 


Qy 


1967 


WA I RKLG L VK YG E K YAGNY SGGNKRKLS T AMAL I GG P P W F LD E PTTGMD P KARR F LWNC 


2026 



Illlllllllllllll I MM I II M II II Mill II Hi II II II M I II II II II II 

Db 1141 WAIRKLGLVKYGEKYASNYSGGNKRKLSTAMALIGGPPWFLDEPTTGMDPKARRFLWNC 1200 

Qy 2027 ALSWKEGRSWLTSHSMEECEALCTRMAIMVNGRFRCLGSVQHLKNRFGDGYTIWRIA 2 086 

II Ml II II 1 1 II Ml ! 1 1 M IMIII II II I M I II II 1 1 1 1 1 Ml 1 1 II I II li II II 

Db 1201 ALSIVKEGRSWLTSHSMEECEALCTRMAIMVNGRFRCLGSVQHLKNRFGDGYTIWRIA 1260 

Qy 2087 GSNPDLKPVQDFFGLAFPGSVPKEKHRNMLQYQLPSSLSSLARIFSILSQSKKRLHIEDY 2146 

Illllllllhllllllllll I MM II M 1 1 1 1 II M M 1 11 M I ! I Ml M 1 1 1 1 1 1 

Db 1261 GSNPDLKPVQEFFGLAFPGSVLKEKHRNMLQYQLPSSLSSLARIFSILSQSKKRLHIEDY 1320 

Qy 214 7 SVSQTTLDQVFVNFAKDQSDDDHLKDLSLHKNQTWDVAVLTSFLQDEKVKESYV 2201 

IIIIIIIIIIIIIIMMIIIIIIIIIMIIIIIIIIMIIIMIIIIIIMIII 

Db 1321 SVSQTTLDQVFVNFAKDQSDDDHLKDLSLHKNQTWDVAVLTSFLQDEKVKESYV 1375 



RESULT 2 

US-08-762-500-26 

Sequence 26, Application US/08762500 
Patent No. 6030806 
GENERAL INFORMATION : 

APPLICANT: Landes, Gregory M. 
APPLICANT: Burn, Timothy C. 
APPLICANT: Connors, Timothy D. 
APPLICANT: Dackowski, William R. 
APPLICANT: Van Raay, Terence J. 
APPLICANT: Klinger, Katherine w. 

TITLE OF INVENTION: NOVEL HUMAN CHROMOSOME 16 GENES, 

TITLE OF INVENTION: COMPOSITIONS , METHODS OF MAKING AND USING SAME 
NUMBER OF SEQUENCES: 83 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: GENZYME CORPORATION 
STREET: One Mountain Road 
CITY : Framingham 
STATE: Massachusetts 
COUNTRY; United States of America 
ZIP: 01701 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/762 , 500 
FILING DATE: 09-DEC-1996 
CLASSIFICATION : 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/665,259 
FILING DATE: 17-JUN-1996 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/US96/1046 9 
FILING DATE: 17-JUN-1996 
ATTORNEY/AGENT INFORMATION : 
NAME: Dugan, Deborah A. 
REGISTRATION NUMBER: 37,315 
REFERENCE/DOCKET NUMBER: IG5-9.3 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (508) 872-8400 
TELEFAX: (508) 872-5415 
INFORMATION FOR SEQ ID NO: 26: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 1375 amino acids 
TYPE: amino acid 
STRANDEDNESS : not relevant 
TOPOLOGY: unknown 
MOLECULE TYPE: protein 
US-08-762-500-26 



Query Match 



60.2%; Score 6909; DB 3; Length 1375; 



Best Local Similarity 96.9%; Pred. No. 0; 

Matches 1332; Conservative 21; Mismatches 22; Indels 0; Gaps 0; 

Qy 827 CMEEE PTHLKIjGVS IQNLVKVYRDGMKVAVDGLALNFYEGQI TSFLGHNGAGKTTTMS I L 886 

II Nil II h II I I I I I I I II II II II IMMIM II I I I II II II M II I III II I I I 

Db 1 CMEEE PTHLRLGVS I QNLVKVYRDGMKVAVDGLALNFYEGQI TSFLGHNGAGKTTTMS I L 6 0 

Qy 887 TGLFPPTSGTAYILGKDIRSEMSTIRQNLGVCPQHNVLFDMLTVEEHIWFYARLKGLSEK 946 

lllllllllllilllilllllhl lllllllllllll jllllllliMIIIMIIIIII 

Db 61 TGLFPPTSGTAYILGKDIRSEMSSIRQNLGVCPQHNVLFDMLTVEEHIWFYARLKGLSEK 120 

Qy 947 HVKAEMEQMALDVGLPSSKLKSKTSQLSGGMQRKLSVALAFVGGSKWILDEPTAGVDPY 1006 

ii ii mm ii 1 1 1 1 1 1 1 1 ii mm ii ii mi ii 1 1 1 mi ii ii ii ii in i ii i: I 

Db 121 HVKAEMEQMALDVGLPPSKLKSKTSQLSGGMQRKLSVALAFVGGSKWILDEPTAGVDPY 180 ' 

Qy 1007 SRRGIWELLLKYRQGRTIILSTHHMDEADVLGDRIAIISHGKLCCVGSSLFLKNQLGTGY 1066 

1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 II 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M I 

Db 181 SRRGIWELLLKYRQGRTI I LSTHHMDEADILGDRIAI ISHGKLCCVGSSLFLKNQLGTGY 240 

Qy 106 7 YLTLVKKDVESSLSSCRNSSSTVSYLKKEDSVSQSSSDAGLGSDHESDTLTIDVSAISNL 1126 

I II II II 1 1 M M M I II I II M I I Ml II II II 1 1 1 1 Ml II II II II II M II I M 

Db 241 YLTLVKKDVESSLSSCRNSSSTVSCLKKEDSVSQSSSDAGLGSDHESDTLTIDVSAISNL 300 

Qy 112 7 IRKHVSEARLVEDIGHELTYVLPYEAAKEGAFVELFHEIDDRLSDLGISSYGISETTLEE 1186 

I MM I II Ml I MM MMM I M IMMI II II 1 1 1 1 1 IMMMI II M M >' I M 

Db 301 IRKHVSEARLVEDIGHELTYVLPYEAAKEGAFVELFHEIDDRLSDLGISSYGISETTLEE 360 

Qy 1187 IFLKVAEESGVDAETSDGTLPARRNRRAFGDKQSCLRPFTEDDAADPNDSDIDPESRETD 124 6 

I Ml I Ml I! 1 1 MM I M II MM M M M II M I MUM, 1 1 1 1 1 1 ( I II 1 1 1 1 I 

Db 361 IFLKVAEESGVDAETSDGTLPARRNRRAFGDKQSCLHPFTEDDAVDPNDSDIDPESRETD 420 

Qy 124 7 L LSGMDG KG S YQ VKG WKLT QQQ F VALLWKRLL I ARR S RKG F FAQ I VL P AVF VC I AL V F S L 1306 

Mil M II Ml M Ml I M M M II M M M II M M 1 1 1 1! I! II II II M IMM I M 

Db 421 LLSGMDGKGSYQLKGWKLTQQQFVALLWKRLLIARRSRKGFFAQIVLPAVFVCIALVFSL 4 80 

Qy 1307 IVPPFGKYPSLELQPWMYNEQYTFVSNDAPEDTGTLELLNALTKDPGFGTRCMEGNPIPD 1366 

M M M M Ml I I M M M M II II M MM I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 

Db 4 81 IVPPFGKYPSLELQPWMYNEQYTFVSNDAPEDMGTQELLNALTKDPGFGTRCMEGNPIPD 54 0 

Qy 1367 TPCQAGEEEWTTAPVPQTIMDLFQNGNWTMQNPSPACQCSSDKIKKMLPVCPPGAGGLPP 14 26 

III Nihil MllhhIllllllllhlimilNIIIINNNIlllllllll 

Db 541 TPCLAGEEDWTISPVPQSIVDLFQNGNWTMKNPSPACQCSSDKIKKMLPVCPPGAGGLPP 600 

Qy 14 27 PQRKQNTADILQDLTGRNISDYLVKTYVQIIAKSLKNKIWVNEFRYGGFSLGVSNTQALP 14 86 

INN I MM I M 1 1 1 1 1 1 1 M M II M II Mil Ml M M II Ml Mil M M M M I 

Db 601 PQRKQKTADILQNLTGRNISDYLVKTYVQIIAKSLKNKIWVNEFRYGGFSLGVSNSQALP 660 

Qy 1487 PSQEWDATKQMKKHLKLAKDSSADRFLNSLGRFMTGLDTRNNVKVWFNNKGWHAISSFL 1546 

II INN INN IN MM II 1 1 MM II II 1 1 M MM 1 1 M M ! ! I M 1 1 1 M 

Db 661 PSHEVNDAIKQMKKLLKLTKDTSADRFLSSLGRFMAGLDTKNNVKVWFNNKGWHAISSFL 720 

Qy 154 7 NVINNAILRANLQKGENPSHYGITAFNHPLNLTKQQLSEVAPMTTSVDVLVSICVIFAMS 16 06 

M MM II MM M Ml II I N 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 N II 1 1 II 1 1 1 1 

Db 721 NVINNAILRANLQKGENPSQYGITAFNHPLNLTKQQLSEVALMTTSVDVLVSICVIFAMS 780 

Qy 1607 FVPASFWFLIQERVSKAKHLQFISGVKPVIYWLSNFVWDMCNYWPATLVIIIFICFQQ 1666 

Ml IMM II 1 1 1 I Ml IMM! I I MM M II II 1 1 1 II IMI II II II II II M 1 1 1 I 

Db 781 FVPASFWFLIQERVSKAKHLQFISGVKPVIYWLSNFVWDMCNYWPATLVIIIFICFQQ 840 

Qy 1667 KSYVSSTNLPVLALLLLLYGWSITPLMYPASFVFKI PSTAYWLTSVNLFIGINGSVATF 1726 

II MM IMI 1 1 1 II II II M II II II I I IMMM MIMI IMMI II II II II II 1 1 

Db 841 KSYVSSTNLPVLALLLLLYGWSITPLMYPASFVFKI PSTAYWLTSVNLFIGINGSVATF 900 

Qy 1727 VLELFTDNKLNNINDI LKSVFLI FPHFCLGRGLIDMVKNQAMADALERFGENRFVS PLSW 1786 

II MIMI M Ml 1 1 II II II II M II MM IMM IMIIMI II II II Ii II 1 1 Ml I 

Db 901 VLELFTNNKLNDINDI LKSVFLI FPHFCLGRGLIDMVKNQAMADALERFGENRFVS PLSW 960 



Qy 

Db 



1787 
961 



DLVGRNLFAMAVEGWFFLITVLIQYRFFIRPRPVNAKLSPLNDEDEDVRRERQRILDGG 1846 
IMI M II I I lllllll I I II II II II II II II II IN I I I I I I I I I I I I I I I I I I I I 
DLVGRNLFAMAVEGWFFLITVLIQYRFFIRPRPVKAKLPPLNDEDEDVRRERQRILDGG 1020 







• • 




Qy 


1847 


GQNDILEIKELTKIYRRKRKPAVDRICVGIPPGECFGLLGVNGAGKSSTFKMLTGDTTVT 


1906 






1 1 1 1 1 1 1 1 1 1 1 1 11 1 1 1 1 1 1 1 1 1 1 1 I 1 . 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 • 1 1 I 1 1 1 I 1 1 II 
1 II 1 II 1 1 1 II 1 1 1 II 1 1 i 1 1 1 II 1 1 1 : 1 1 1 1 1 II 1 1 II II 1 1 1 M 1 • 1 1 1 1 M 1 1 1 II 




Db 


1021 


GQNDILEIKELTKIYRRKRKPAVDRICIGIPPGECFGLLGVNGAGKSTTFKMLTGDTPVT 


1080 


Qy 


1907 


RGDAFLNRNSILSNIHEVHQNMGYCPQFDAITELLTGREHVEFFALLRGVPEKEVGKVGE 


1966 






i i i i i i t I I I I I I I I i I I I I I I I I I I I I t I 1 I I I 1 I I t I t t 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 
1 1 I I I 1 I : I 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 




Db 


1081 


RGDAFLNKNSILSNIHEVHQNMGYCPQFDAITELLTGREHVEFFALLRGVPEKEVGKFGE 


1140 


Qy 


1967 


WA I RKLG L VK YG E K Y AGNY S GGN KR KL S T AMAL I GG P PWFLD E PTTGMD P KARR F LWN C 


2026 




i i ) i i i i t i i i i i i i i 1 I t I I t I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I 1 I 1 I 1 1 1 I 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 




Db 


1141 


W A I RKLG L VKYG E K YAS N Y S GGN KRKL S T AMAL I GG P P W F LD E PTTGMD P KARR F LWNC 


1200 


Qy 


2027 


ALSWKEGRSWLTSHSMEECEALCTRMAIMVNGRFRCLGSVQHLKNRFGDGYTIWRIA 


2086 






lit i i i l l l l ll l l l l l l l l l 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 I 1 : I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1201 


ALSIVKEGRSWLTSHSMEECEALCTRMAIMVNGRFRCLGSVQHLKNRFGDGYTIWRIA 


1260 


Qy 


2087 


GSNPDLKPVQDFFGLAFPGSVPKEKHRNMLQYQLPSSLSSLARIFSILSQSKKRLHIEDY 


2146 






i i i l l t l l l l l l l l l l l l l l l 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 h 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1261 


GSNPDLKPVQEFFGLAFPGSVLKEKHRNMLQYQLPSSLSSLARIFSILSQSKKRLHIEDY 


1320 


Qy 


2147 


SVSQTTLDQVFVNFAKDQSDDDHLKDLSLHKNQTWDVAVLTSFLQDEKVKESYV 2201 








IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIII 




Db 


1321 


S VSQTTLDQVFVNFAKDQSDDDHLKDLS LHKNQTWDVAVLTS FLQDEKVKES YV 1375 





SUMMARIES 

% 



Result 
No. 


Score 


Query- 
Match 


Length DB 


ID 


Description 


1 


10906 


95 


.1 


2201 


2 


A54774 


ATP binding casset 


2 


3338.5 


29 


. 1 


1529 


2 


A59189 


ATP-binding casset 


3 


3129 . 5 


27 , 


.3 


1472 


2 


B54774 


ATP binding casset 


4 


2638.5 


23 


.0 


1704 


2 


S71363 


probable ATP-bindi 


5 


2635.5 


23 


.0 


1704 


2 


A59188 


ATP-binding casset 


6 


2061 


18 , 


.0 


1802 


2 


T33783 


hypothetical prote 


7 


1975 


17. 


.2 


1816 


2 


A84845 


probable ABC trans 


8 


1854.5 


16, 


.2 


1447 


2 


T15200 


hypothetical prote 


9 


1792 


15. 


.6 


1317 


2 


C88925 


protein F33E11.4 [ 


10 


1535.5 


13 . 


.4 


1758 


2 


F88559 


protein C48B4.4b [ 


11 


1528.5 


13 . 


.3 


1704 


2 


T42749 


ATP-binding casset 


12 


1526 


13 . 


.3 


1767 


2 


S60124 


transport protein 


13 


1393 


12 . 


.1 


1246 


2 


T00826 


hypothetical prote 


14 


1150.5 


10. 


.0 


1564 


2 


T27121 


hypothetical prote 


15 


1006.5 


8. 


,8 


373 


2 


T47150 


hypothetical prote 


16 


1005.5 


8. 


.8 


1431 


2 


T22748 


hypothetical prote 


17 


846 


7. 


,4 


269 


2 


T46467 


hypothetical prote 


18 


786 .5 


6. 


.9 


1011 


2 


T07712 


probable ABC-type 


19 


695 


6. 


.1 


900 


2 


T07717 


probable ABC -type 



RESULT 1 
A54774 

ATP binding cassette transporter ABC1 - mouse 
C; Species: Mus musculus (house mouse) 

C;Date: 05-Apr-1995 #sequence_revision 05-Apr-1995 #text_change 02-Feb-2001 
C; Accession: A54774 

R;Luciani, M.F.; Denizot, F . ; Savary, S.; Mattei, M.G.; Chimini, G. 
Genomics 21, 150-159, 1994 

A; Title: Cloning of two novel ABC transporters mapping on human chromosome 9. 
A;Reference number: A54774; MUID : 94375008 ; PMID:8088782 
A;Accession: A54774 
A;Molecule type: mRNA 
A; Residues : 1-2 201 <LUC> 

A; Cross-references: GB:X75926; NID:g495256; PIDN : CAA53530 . 1 ; PID:g495257 

C;Superf amily : unassigned ATP-binding cassette proteins; ATP-binding cassette homology 

C;Keywords: ATP; duplication; nucleotide binding; P-loop 

F; 856-104 7/Domain : ATP-binding cassette homology <ABC1> 

F; 873 - 880/Region : nucleotide -binding motif A (P-loop) 



F; 1869-2060/Domain: ATP-binding cassette homology <ABC2> 
F;1886-1893/Region: nucleotide -binding motif A (P-loop) 



Query Match 95.1%; Score 10906; DB 2; Length 2201; 

Best Local Similarity 94.8%; Pred. No. 0; 

Matches 2087; Conservative 54; Mismatches 60; Indels 0; Gaps 0; 



Qy 


1 


MPSAGTLPWVQGIICNANNPCFRYPTPGEAPGWGNFNKSIVARLFSDARRLLLYSQKDT 


b U 




lllllllllllllllllllllllllllllllllllllllilMIIIIMII IIIMI 




Db 


1 


MPSAGTLPWVQGIICNANNPCFRYPTPGEAPGWGNFNKSIVSRLFSDAQRLLLYSQRDT 


60 


Qy 


61 


SMKDMRKVLRTLQQIKKSSSNLKLQDr LVDNETr btjr LYrLNLbbi'K.b 1 VUJsJnLtKAUVlLirl 


ion 




hill MM hill M II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 IIMIhllll = h • 1 1 




Db 


61 


SIKDMHKVLRMLRQIKHPNSNLKLQDFLVDNETFSGFLQHNLSLPRSTVDSLLQXNVGLQ 


120 


Qy 


121 


KVFLQGYQLHLTSLCNGSKSEEMIQLGDQEVSELCGLPREKLAAAERVLRSNMDILKPIL 

MMIIIMM lllllll Ihlllll III lllllhll lllllll IMIilh: 


180 


Db 


121 


KVFLQGYQLHLASLCNGSKLEEIIQLGDAEVSALCGLPRKKLDAAERVLRYNMDILKPW 


180 


Qy 


181 


RTI^STSPFPSKEIjAEATKTLLHSLGTLAQELFSMRSWSDMRQEVMFLTNVNSSSSSTQI 


24 0 




II 1 1 1 1 - 1 1 1 1 1 II III lllllll = 1 M 1 1 II II 1 1 II M 1 1 II II II 1 




Db 


181 


TKLNSTSHLPTQHLAEATTVLLDSLGGLAQELFSTKSWSDMRQEVMFLTNVNSSSSSTQI 


240 


Qy 


241 


YQ A V SRI VCGH PEGGGLKIKS LNW Y E DNN YKAL FGGNGT E E DAE T F YDN S TT P YCNDLMK 


300 






1 1 1 1 1 1 II 1 1 II 1 1 1 M 1 1 1 II 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II 1 M 1 1 1 1 1 1 1 1 M II II 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 * 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 * 1 1 1 1 1 1 




Db 


241 


YQ A VS R I VCGH P EGGG L K I KS LNW Y E DNN YKAL FGGNNT E E D VDT F YDN S TT P YCND LMK 


300 


Qy 


301 


NLESS PLSRI IWKALKPLLVGKI LYTPDTPATRQVMAEVNKTFQELAVFHDLEGMWEELS 


360 




1 II 1 1 1 M 1 1 M 1 1 M M M 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 II 1 II 1 II II 1 M 




Db 


301 


NLESS PLSRI IWKALKPLLVGKI LYTPDTPATRQVMAEVNKTFQELAVFHDLEGMWEELS 


360 


Qy 


361 


PKIWTFMENSQEMDLVRMLLDSRDNDHFWEQQLDGLDWTAQDIVAFLAKHPEDVQSSNGS 


4 2 0 






h 1 1 II 1 1 1 II II 1 1 1 1 1 1 1 II II 1 1 1 h 1 1 1 1 1 1 M II h 1 1 1 1 h II 1 M 1 III 
i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 iiiiiii ii ii 1 1 1 1 1 * 1 1 1 1 1 iii 




Db 


361 


P Q I WT F MEN S QE MD L VRTLLD S RGNDQ FW E QKLDGLD WT AQD I MAFLAKN P E D VQ S PNG S 


420 


Qy 


421 


VYTWREAFNETNQAIRTISRFMECVNLNKLEPIATEVWLINKSMELLDERKFWAGIVFTG 


4 80 






1 1 1 1 1 1 1 1 i 1 1 II 1 h 1 1 1 1 1 1 M 1 M 1 1 1 II 1 III M M 1 M II II 1 1 II 1 II 1 1 M 




Db 


421 


VYTWREAFNETNQAIQTISRFMECVNLNKLEPIPTEVRLINKSMELLDERKFWAGIVFTG 


480 


Qy 


481 


ITPGSIELPHHVKYKIRMDIDNVERTNKIKDGYWDPGPRADPFEDMRYVWGGFAYLQDW 


54 0 




III hMMIMMMMMMMMIMIMMMIMMMMMMMMIIMM 




Db 


481 


ITPDSVELPHHVKYKIRMDIDNVERTNKIKDGYWDPGPRADPFEDMRYVWGGFAYLQDW 


540 


Qy 


541 


EQAI IRVLTGTEKKTGVYMQQMPYPCYVDDIFLRVMSRSMPLFMTLAWI YSVAVI IKGIV 


600 






1 M 1 1 1 II 1 1 M II II II M M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 II 1 1 M 1 II 1 II 1 II 




Db 


541 


EQAI I RVLTGSEKKTGVYVQQMPYPCYVDDI FLRVMSRSMPLFMTLAWI YSVAVI IKS IV 


600 


Qy 


601 


YEKEARLKETMRIMGLDNSILWFSWFISSLIPLLVSAGLLWILKLGNLLPYSDPSWFV 


660 




II 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 M h 1 II 1 II II 1 M 1 1 1 1 1 1 1 1 M M 1 1 II II 1 II 1 
iiiiiiiiiiiiiiiiii iiiiiii 1 1 1 1 1 1 1 ) 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


601 


YEKEARLKETMRIMGLDNGILWFSWFVSSLIPLLVSAGLLWILKLGNLLPYSDPSWFV 


660 


Qy 


661 


FLSVFAWTI LQCFLISTLFSRANLAAACGGI I YFTLYLPYVLCVAWQDYVGFTLKI FAS 


720 






M II 1 h 1 1 II 1 1 1 II II 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 II 1 1 M M 1 M II 1 1 1 1 h M 1 1 1 1 
1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 > 1 1 1 1 1 




Db 


661 


FLSVFAMVTI LQCFLI STLFSRANLAAACGGI I YFTLYLPYVLCVAWQDYVGFSIKI FAS 


720 


Qy 


721 


LLSPVAFGFGCEYFALFEEQGIGVQWDNLFESPVEEDGFNLTTSVSMMLFDTFLYGVMTW 


780 






1 II 1 II 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 h 1 1 1 1 1 1 1 1 1 1 II 1 M 1 




Db 


721 


LLSPVAFGFGCEYFALFEEQGIGVQWDNLFESPVEEDGFNLTTAVSMMLFDTFLYGVMTW 


780 


Qy 


781 


YIEAVFPGQYGIPRPWYFPCTKSYWFGEESDEKSHPGSNQKRISEICMEEEPTHLKLGVS 


840 






IMIMMMIMMIMMIMMIMI IIMIIIhll - II 1 1 1 1 II 1 II h 1 II 1 




Db 


781 


YIEAVFPGQYGIPRPWYFPCTKSYWFGEEIDEKSHPGSSQKGVSEICMEEEPTHLRLGVS 


840 


Qy 


841 


IQNLVKVYRDGMKVAVDGLALNFYEGQITSFLGHNGAGKTTTMSILTGLFPPTSGTAYIL 


900 




IIMMMIMMMMMMMMIMMMMMIIMMMIMMIMMMIMI 




Db 


841 


IQNLVKVYRDGMKVAVDGLALNFYEGQITSFLGHNGAGKTTTMSILTGLFPPTSGTAYIL 


900 


Qy 


901 


GKDIRSEMSTIRQNLGVCPQHNVLFDMLTVEEHIWFYARLKGLSEKHVKAEMEQMALDVG 


960 






MllllllhlMMMIIIIIIMIIIIIMIIIMIIIMMIIIIMMMMIIII 




Db 


901 


GKDIRSEMSSIRQNLGVCPQHNVLFDMLTVEEHIWFYARLKGLSEKHVKAEMEQMALDVG 


960 



Qy 


961 


Db 


961 


Qy 


1021 


Db 


1021 


Qy 


1081 


Db 


1081 


Qy 


1141 


Db 


1141 


Qy 


1201 


Db 


1201 


Qy 


1261 


Db 


1261 


Qy 


1321 


Db 


1321 


Qy 


1381 


Db 


1381 


Qy 


1441 


Db 


1441 


Qy 


1501 


Db 


1501 


Qy 


1561 


Db 


1561 


Qy 


1621 


Db 


1621 


Qy 


1681 


Db 


1681 


Qy 


1741 


Db 


1741 


Qy 


1801 


Db 


1801 


Qy 


1861 


Db 


1861 


Qy 


1921 


Db 


1921 


Qy 


1981 


Db 


1981 



LPSSKLKSKTSQLSGGMQRKLSVALAFVGGSKWILDEPTAGVDPYSRRGIWELLLKYRQ 

II Ml IMMIIMMMIM IMMMil MM Ml Mill MM MMMIMMI 

LPPSKLKSKTSQLSGGMQRKLSVALAFVGGSKWILDEPTAGVDPYSRRGIWELLLKYRQ 



1020 



1020 



GRTIILSTHHMDEADVLGDRIAIISHGKLCCVGSSLFLKNQLGTGYYLTLVKKDVESSLS 1080 

I II II II II II MM :| 1111 II II II II 1 1 1 1 1 1 1 1 1 1 1 II II MM II M II Mil M 

GRTIILSTHHMDEADILGDRIAIISHGKLCCVGSSLFLKNQLGTGYYLTLVKKDVESSLS 1080 

SCRNSSSTVSYLKKEDSVSQSSSDAGU3SDHESDTLTIDVSAISNLIRKHVSEARLVEDI 114 0 

IMIIIMII 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 !! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

SCRNSSSTVSCLKKEDSVSQSSSDAGLGSDHESDTLTIDVSAISNLIRKHVSEARLVEDI 114 0 

GHELTYVLPYEAAKEGAFVELFHEIDDRLSDLGISSYGISETTLEEIFLKVAEESGVDAE 1200 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1! M 1 1 1 1 1 M 1 

GHELTYVLPYEAAKEGAFVELFHEIDDRLSDLGISSYGISETTLEEIFLKVAEESGVDAE 1200 

TSDGTLPARRNRRAFGDKQSCLRPFTEDDAADPNDSDIDPESRETDLLSGMDGKGSYQVK 1260 

1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 i 1 1 I M MM M M II II II II MM II M M 

TSDGTLPARRNRRAFGDKQSCLHPFTEDDAVDPNDSDIDPESRETDLLSGMDGKGSYQLK 1260 



GWKLTQQQFVALLWKRLLIARRSRKGFFAQIVLPAVFVCIALVFSLIVPPFGKYPSLELQ 

IM M M M M II II 1 1 II I IMM II M M M M M M M Ml II M II IMMMI II 

GWKLTQQQFVALLWKRLLIARRSRKGFFAQIVLPAVFVCIALVFSLIVPPFGKYPSLELQ 



1320 



1320 



PWMYNEQYTFVSNDAPEDTGTLELLNALTKDPGFGTRCMEGNPIPDTPCQAGEEEWTTAP 1380 

lllllllllllllll III II lllll I III I III I III I MM Mil I lllhll : I 

PWMYNEQYTFVSNDAPEDMGTQELLNALTKDPGFGTRCMEGNPIPDTPCLAGEEDWTISP 1380 

V PQT I MD LFQNGNWTMQN PSPACQCSSDKI KKML P VC P PG AGG L P P PQRKQNT AD I LQD L 1440 

![ I H = 1 1 1 1 1 1 1 1 1 1 = 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 i M 1 1 1 1 M N 1 1 1 i 1 1 1 1 1 = I 

V PQ S I VD LFQNGNWTM KN PSPACQCSSDKI KKML P VC P PG AGG L P P PQR KQKT AD I LQN L 144 0 
TGRNI SDYLVKTYVQI I AKSLKNKI WVNEFRYGGFSLGVSNTQALPPSQEVNDATKQMKK 1500 

1 1 1 1 1 1 1 1 E 1 1 1 1 i f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 = 1 1 1 1 i I lllll lllll 

TGRN I SD YL VKT YVQ I I AKS L KN K I WVNE FR YGG F S LG VS N S Q AL P P S H E VND A I KQMKK 1500 

HLKLAKDS SADRFLNSLGRFMTGLDTRNNVKVWFNNKGWHAI S SFLNVINNAI LRANLQK 1560 

III IMMMMMIMM Ml IMM M M ! Ml MM I M II M II 1 1 M II MM 

LLKLTKDTSADRFLSSLGRFMAGLDTKNNVKVWFNNKGWHAISSFLNVINN 1560 

GENPSHYGITAFNHPLNLTKQQLSEVAPMTTSVDVLVSICVIFAMSFVPASFWFLIQER 1620 

lllll IM MM II 1 1 M IMMI II IIIMMIIIIIIMIIIIIIIIIMIIIIII 

G EN PS Q YG I T A FNH P LN LT KQQLS E V ALMTT S VD VL VS I C V I F AM S F V P AS FWFL I QE R 1620 

VSKAKHLQFI SGVKPVI YWLSNFVWDMCNYWPATLVI I I FICFQQKSYVSSTNLPVLAL 1680 

I M Mil IMMI II I I II II IMMI 1 1 1 1 1 M I M M I M II II M IMMI || IM 

VSKAKHLQFI SGVKPVI YWLSNFVWDMCNYWPATLVI I I FICFQQKSYVSSTNLPVLAL 1680 

LLLLYGWSITPLMYPASFVFKIPSTAYWLTSVNLFIGINGSVATFVLELFTDNKLNNIN 1740 

I II IM MM II I II 1 1 1 1 II IMMI M M M M M M M I II IMMI IMi || MM 

L LL LYG W S I T P LM Y PASFVFKIPS T A YWLT S VNL F I G I NG S VAT F VLE L FTNN KLND I N 1740 

DILKSVFLI FPHFCLGRGLI DMVKNQAMADALERFGENRFVS PLSWDLVGRNLFAMAVEG 1800 

I I MIM Mill! Ill 1 1 1 MIMI II I M 1 1 M IMM I 1 1 II II II II M II II M II 

DILKSVFLI FPHFCLGRGLI DMVKNQAMADALERFGENRFVS PLSWDLVGRNLFAMAVEG 1800 

WFFLITVLIQYRFFIRPRPVNAKLSPLNDEDEDVRRERQRILDGGGQNDILEIKELTKI 1860 

I III 1 1 1 III 1 1 1 MM I 1 1 i Ml I Ml II II I II IM 1 1 II II II II II II II MM 

W F F L I T VL IQYRFFIRPR P VKAKL P P LND E D ED VRR E RQR I LDGGGQND I L E I KE LT K I 186 0 

YRRKRKPAVDRICVGIPPGECFGLLGVNGAGKSSTFKMLTGDTTVTRGDAFLNRNSILSN 1920 

I I III II II II II M II I I II I I II II MM I I M li IM II I II I I II II IMI MM 

YRRKRKPAVDRICIGIPPGECFGLLGVNGAGKSTTFKMLTGDTPVTRGDAFLNKNSILSN 1920 

IHEVHQNMGYCPQFDAITELLTGREHVEFFALLRGVPEKEVGKVGEWAIRKLGLVKYGEK 1980 

I IMMI MM MM M IMI IMMI I II II 1 1 1 MIM III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

IHEVHQNMGYCPQFDAITELLTGREHVEFFALLRGVPEKEVGKFGEWAIRKLGLVKYGEK 1980 

YAGNYSGGNKRKLSTAMALIGGPPWFLDEPTTGMDPKARRFLWNCALSWKEGRSWLT 204 0 

II 1 1 1 1 Ml I II II II II III 1 1 1 1 1 M II II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 M 1 1 1 1 1 II II 

YASNYSGGNKRKLSTAMALIGGPPWFLDEPTTGMDPKARRFLWNCALSI VKEGRS WLT 204 0 



Qy 2041 SHSMEECEALCTRMAIMWGRFRCI/3SVQHLKNRFGDGYTIWRIAGSNPDLKPVQDFFG 2100 

I III II Nil M 1 1 1 1 II Nil II 1 1 MUM Illllll II M II 1 1 1 1 1 II I II h II I 

Db 2 041 SHSMEECEALCTRMAIMWGRFRCLGSVQHLKNRFGDGYTIVVRIAGSNPDLKPVQEFFG 2100 

Qy 2101 LAFPGSVPKEKHRNMLQYQLPSSLSSLARIFSILSQSKKRLHIEDYSVSQTTLDQVFVNF 2160 

Illllll 1 1 1 M II 1 1 Ml MM I MIMM IMIIIMI II II M I II II I M M II I . 

Db 2101 LAFPGSVLKEKHRNMLQYQLPSSLSSLARIFSILSQSKKRLHIEDYSVSQTTLDQVFVNF 2160 

Qy 2161 AKDQSDDDHLKDLSLHKNQTWDVAVLTSFLQDEKVKESYV 2201 

Ml III III 1 1 M II II I MM II I MM IM MM II III 

Db 2161 AKDQSDDDHLKDLSLHKNQTWDVAVLTSFLQDEKVKESYV 2201 



SUMMARIES 

% 

Result Query 



No. 


Score 


Match 


Length DB 


ID 


Description 


1 


11466 


100, 


.0 


2261 


1 


ABC1_HUMAN 


095477 


homo sapien 


2 


10906 


95 . 


.1 


2261 


1 


ABCl_MOUSE 


P41233 


mus musculu 


3 


5689.5 


49, 


.6 


2273 


1 


ABCR HUMAN 


P78363 


homo sapien 


4 


4131 


36 , 


.0 


2436 


1 


ABC2_HUMAN 


Q9bzc7 


homo sapien 


5 


3989.5 


34 , 


.8 


2434 


1 


ABC2_MOUSE 


P41234 


mus musculu 


6 


2635.5 


23 , 


.0 


1704 


1 


ABC3_HUMAN 


Q99758 


homo sapien 


7 


1528.5 


13 , 


.3 


1704 


1 


CED7_CAEEL 


P34358 


caenorhabdi 


8 


411 


3 , 


.6 


330 


1 


DRRA_STRPE 


P32010 


streptomyce 


9 


380.5 


3 , 


.3 


343 


1 


NODI_RHISN 


P55476 


rhizobium s 


10 


366 


3 , 


. 2 


304 


1 


NODI_RHIS3 


P72335 


rhizobium s 


11 


347 


3 , 


.0 


308 


1 


YADG_ECOLI 


P36879 


escherichia 


12 


347 


3 , 


.0 


335 


1 


NDI2_RHIME 


Q8gnh6 


rhizobium m 


13 


344.5 


3, 


.0 


340 


1 


NODI_RHILO 


P23703 


rhizobium 1 


14 


343 .5 


3 , 


.0 


347 


1 


NODI_RHIGA 


P50332 


rhizobium g 


15 


335 .5 


2 , 


. 9 


1280 


1 


MDR1_HUMAN 


P08183 


homo sapien 


16 


331 .5 


2, 


. 9 


355 


1 


NDI1_RHIME 


052618 


rhizobium m 


17 


329.5 


2. 


.9 


578 


1 


YBHF_ECOLI 


P75776 


escherichia 


18 


327 


2 . 


.9 


894 


1 


YHIH_ECOLI 


P37624 


escherichia 


19 


327 


2 . 


.9 


1276 


1 


MDR3 MOUSE 


P21447 


mus musculu 



RESULT 2 
ABC1_M0USE 

ID ABC1_M0USE STANDARD; PRT; 2261 AA. 

AC P41233; 

DT 01-FEB-1995 (Rel. 31, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE ATP-binding cassette, sub- family A, member 1 (ATP-binding cassette 

DE transporter 1) (ATP-binding cassette 1) (ABC-1) . 

GN ABCA1 OR ABC1 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N . A . 

RC STRAIN=DBA/2 ; TISSUE ^Macrophage ; 

RX MEDLINE=94375008; PubMed=8088782 ; 

RA Luciani M.F., Denizot F . , Savary S., Mattei M.-G., Chimini G.; 

RT "Cloning of two novel ABC transporters mapping on human chromosome 

RT 9 . " ; 

RL Genomics 21:150-159(1994). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J ; 

RX MEDLINE=21251004; PubMed=11352567 ; 

RA Qiu Y . , Cavelier L., Chiu S., Yang X., Rubin E . , Cheng J.-F.; 

RT "Human and mouse ABCA1 comparative sequencing and transgenesis 




RT 


studies revealing 


novel regulatory sequences . 11 ; 


RL 


Genomics 


73 :66-76 (2001) . 




CC 


-!- FUNCTION: CAMP-DEPENDENT AND SULFONYLUREA- SENSITIVE ANION 


cc 


TRANSPORTER. KEY GATEKEEPER INFLUENCING INTRACELLULAR CHOLESTEROL 


cc 


TRANSPORT (BY 


SIMILARITY) . 


cc 


-!- TISSUE SPECIFICITY: WIDELY EXPRESSED IN ADULT TISSUES. HIGHEST 


cc 


LEVELS ARE FOUND IN PREGNANT UTERUS AND UTERUS. 


cc 


-!- DOMAIN: MULTIFUNCTIONAL POLYPEPTIDE WITH TWO HOMOLOGOUS HALVES, 


cc 


EACH 


CONTAINING AN HYDROPHOBIC MEMBRANE -ANCHORING DOMAIN AND AN 


cc 


ATP BINDING CASSETTE 


(ABC) DOMAIN. 


cc 


- ! - PTM : 


Phosphorylat ion 


on Ser-2054 regulates phospholipid efflux (By 


cc 


similarity) . 






cc 
cc 
cc 


-!- SIMILARITY: BELONGS TO THE ABC TRANSPORTER FAMILY. ABCA SUBFAMILY. 


This SWISS -PROT entry is 


copyright. It is produced through a collaboration 


cc 


between 


the Swiss Institute of Bioinf ormatics and the EMBL outstation - 


cc 


the European Bioinf ormatics Institute. There are no restrictions on its 


cc 


use by 


non-profit institutions as long as its content is in no way 


cc 


modified 


and this 


statement is not removed. Usage by and for commercial 


cc 


entities 


requires 


a license agreement (See http://www.isb-sib.ch/announce/ 


cc 
cc 

DR 


or send an email to license@isb-sib . ch) . 


EMBL; X75 926; CAA53 53 0.1; 


ALT_INIT . 


DR 


EMBL; AF287263; AAG39073 . 


1; ALT_INIT. 


DR 


MGD; MGI: 


99607; Abcal . 




DR 


GO; GO: 0008203; P 


: cholesterol metabolism; IDA. 


DR 


GO; GO: 0030301; P 


: cholesterol transport; IDA. 


DR 


InterPro; 


IPR003593; AAA_ 


ATPase . 


DR 


InterPro; 


IPR003439; ABC_ 


transporter . 


DR 


Pfam; PF00005; ABC tran; 


2 . 


DR 


ProDom; PD000006; 


ABC transporter; 2. 


DR 


SMART; SM00382; AAA; 2. 




DR 


PROSITE; 


PS00211; 


ABC_TRANS PORTER_l ; 1 . 


DR 


PROSITE; 


PS50893; 


ABC TRANSPORTER 2; 2. 


KW 


ATP-binding; Glycoprotein 


; Transmembrane; Transport; Phosphorylation. 


FT 


TRANSMEM 


26 


42 


POTENTIAL. 


FT 


TRANSMEM 


640 


656 


POTENTIAL. 


FT 


TRANSMEM 


690 


706 


POTENTIAL. 


FT 


TRANSMEM 


717 


733 


POTENTIAL . 


FT 


TRANSMEM 


749 


765 


POTENTIAL . 


FT 


TRANSMEM 


771 


787 


POTENTIAL . 


FT 


TRANSMEM 


1041 


1057 


POTENTIAL . 


FT 


TRANSMEM 


1351 


1367 


POTENTIAL. 


FT 


TRANSMEM 


1661 


1677 


POTENTIAL. 


FT 


TRANSMEM 


1708 


1724 


POTENTIAL. 


FT 


TRANSMEM 


1737 


1753 


POTENTIAL. 


FT 


TRANSMEM 


1775 


1791 


POTENTIAL . 


FT 


TRANSMEM 


1854 


1870 


POTENTIAL. 


FT 


NP_BIND 


933 


940 


ATP (POTENTIAL) . 


FT 


NP_BIND 


1946 


1953 


ATP (POTENTIAL) . 


FT 


M0D_RES 


1042 


1042 


PHOSPHORYLATION (BY PKA) (MAJOR) (BY 


FT 








SIMILARITY) . 


FT 


MOD_RES 


2054 


2054 


PHOSPHORYLATION (BY PKA) (MAJOR) (BY 


FT 








SIMILARITY) . 


FT 


CARBOHYD 


14 


14 


N- LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


98 


98 


N- LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


151 


151 


N- LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


161 


161 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


196 


196 


N- LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


244 


244 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


292 


292 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


337 


337 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


349 


349 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


400 


400 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


478 


478 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


489 


489 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


521 


521 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


820 


820 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


1144 


1144 


N-LINKED (GLCNAC. . .) (POTENTIAL) . 


FT 


CARBOHYD 


1294 


1294 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


1453 


1453 


N-LINKED (GLCNAC. . .) (POTENTIAL). 



FT 


C ARB OH YD 


14 99 


14 99 


N- LINKED (GLCNAC 






FT 


C ARB OH YD 


1504 


1504 


N-LINKED (GLCNAC 




) (POTENTIAL) 


FT 


CARBOHYD 


1637 


1637 


N-LINKED (GLCNAC 




) (POTENTIAL) 


FT 


CARBOHYD 


2044 


2044 


N-LINKED (GLCNAC 




) (POTENTIAL) 


FT 


CARBOHYD 


2238 


2238 


N-LINKED (GLCNAC 




) (POTENTIAL) 


FT 


CONFLICT 


1567 


1568 


MISSING (IN REF. 


2) . 




FT 


CONFLICT 


2024 


2024 


MISSING (IN REF. 


2) . 




SQ 


SEQUENCE 


2261 


AA; 254011 


MW; FAE62B21FD1D09F9 


CRC64 ; 


Query Match 




95.1%; 


Score 10906; DB 


1; 


Length 2261; 



Best Local Similarity 94.8%; Pred. No. 0; 
Matches 2087; Conservative 54; Mismatches 60; 



Indels 



0 ; Gaps 



Qy 


1 


Db 


61 


Qy 


61 


Db 


121 


Qy 


121 


Db 


181 


Qy 


181 


Db 


241 


Qy 


241 


Db 


301 


Qy 


301 


Db 


361 


Qy 


361 


Db 


421 


Qy 


421 


Db 


481 


Qy 


481 


Db 


541 


Qy 


541 


Db 


601 


Qy 


601 


Db 


661 


Qy 


661 


Db 


721 


Qy 


721 


Db 


781 


Qy 


781 


Db 


841 


Qy 


841 



MPSAGTLPWVQGIICNANNPCFRYPTPGEAPGWGNFNKSIVARLFSDARRLLLYSQKDT 60 

II hi II h II I Mil M il li M I II I Ml I I I II IMI I I Mill II U I II llhll 
MPSAGTLPWVQGIICNANNPCFRYPTPGEAPGWGNFNKSIVSRLFSDAQRLLLYSQRDT 120 

SMKDMRKVLRTLQQIKKSSSNLKLQDFLVDNETFSGFLYHNLSLPKSTVDKMLRADVILH 120 

hill I I I I hill :I I I I I I I I ! I! I I I I I I I I lllllhllll :h : I I 
SIKDMHKVLRMLRQIKHPNSNLKLQDFLVDNETFSGFLQHNLSLPRSTVDSLLQXNVGLQ 180 

KVFLQGYQLHLTSLCNGSKSEEMIQLGDQEVSELCGLPREKLAAAERVLRSNMDILKPIL 180 

IIIIIIIIMI lllllll Ihlllll III lllllhll IMIMI lllllih: 
KVFLQGYQLHLASLCNGSKLEEI IQLGDAEVSALCGLPRKKLDAAERVLRYNMDILKPW 24 0 

RTLNSTSPFPSKELAEATKTLLHSLGTLAQELFSMRSWSDMRQEVMFLTNVNSSSSSTQI 24 0 

inn i- inn ii iii iiiiiii :iiiiiiiiiimiiimiini 

TKLNSTSHLPTQHLAEATTVLLDSLGGLAQELFSTKSWSDMRQEVMFLTNVNSSSSSTQI 300 
YQAVSRIVCGHPEGGGLKIKSLNWYEDNNYKALFGGNGTEEDAETFYDNSTTPYCNDLMK 300 

iiiiiiiiiiiiiiiiiiiiiiMiiiiiiiiiiiii mi hiiiiiiiiiiiiiii 

YQAVSRIVCGHPEGGGLKIKSLNWYEDNNYKALFGGNNTEEDVDTFYDNSTTPYCNDLMK 360 
NLESSPLSRIIWKALKPLLVGKILYTPDTPATRQVMAEVNKTFQELAVFHDLEGMWEELS 360 

IIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIillllllllllllMIII 

NLESSPLSRIIWKALKPLLVGKILYTPDTPATRQVMAEVNKTFQELAVFHDLEGMWEELS 4 20 
P K I WT F M ENS Q E MDL VRMLLD S RDNDH F WE QQ LDGLDWT AQD I VA FLAKH P E D VQ S S NG S 4 20 

hlllllllMIIIIII Mill II lllhllllllllMhllllhllllll III 

PQIWTFMENSQEMDLVRTLLDSRGNDQFWEQKLDGLDWTAQDIMAFLAKNPEDVQSPNGS 4 80 
VYTWREAFNETNQAIRTISRFMECVNLNKLEPIATEVWLINKSMELLDERKFWAGIVFTG 4 80 

IIIIIIIIIIMIIhllimilllllMMI III IIIIIIIIIIMIIIIIIIIII 

VYTWRE A FN ETNQ A I QT I SR FM E C VN LNKL E P I PTEVRLINKSMELLDERKFWAGIVFTG 54 0 
ITPGSIELPHHVKYKIRMDIDNVERTNKIKDGYWDPGPRADPFEDMRYVWGGFAYLQDW 54 0 

III hi II M II I II II III! 1 1 III Nil 1 1 1 II II II 1 1 1 II M I 1 1 M II I! II!! 

ITPDSVELPHHVKYKIRMDIDNVERTNKIKDGYWDPGPRADPFEDMRYVWGGFAYLQDW 600 
EQAIIRVLTGTEKKTGVYMQQMPYPCYVDDIFLRVMSRSMPLFMTLAWIYSVAVIIKGIV 600 

n ii i n ii in; i iinn in 1 1 n nil 1 1 1 ii ii ii 1 1 1 n ii n 1 1 1 ii m n 

EQAIIRVLTGSEKKTGVYVQQMPYPCYVDDIFLRVMSRSMPLFMTLAWIYSVAVIIKSIV 660 
YEKEARLKETMRIMGLDNSILWFSWFISSLIPLLVSAGLLWILKLGNLLPYSDPSWFV 660 

ni 1 1 n 1 1 1 ! 1 1 1 i n i i n 1 1 1 mi n 1 1 1 ii ii in i iii ii ii 1 1 ni i in ii 

YEKEARLKETMRIMGLDNGILWFSWFVSSLIPLLVSAGLLWILKLGNLLPYSDPSWFV 720 
FLSVFAWTILQCFLISTLFSRANLAAACGGIIYFTLYLPYVLCVAWQDYVGFTLKIFAS 720 

1 1 1 1 1 n m 1 1 1 n 1 1 n 1 1 ii 1 1 1 ii 1 1 1 n 1 1 1 1 1 1 1 1 1 : 1 1 ii n i n i n n i n 

FLSVFAMVTILQCFLISTLFSRANLAAACGGIIYFTLYLPYVLCVAWQDYVGFSIKIFAS 780 
LLS PVAFGFGCEYFALFEEQGIGVQWDNLFESPVEEDGFNLTTSVSMMLFDTFLYGVMTW 780 

ii ii i ii ii ii ii ill ii ii ii i mi ii ii ii i M ii ii i ! niiin 1 1 1 1 1 ii in 

LLS PVAFGFGCEYFALFEEQGIGVQWDNLFESPVEEDGFNLTTAVSMMLFDTFLYGVMTW 84 0 
YIEAVFPGQYGIPRPWYFPCTKSYWFGEESDEKSHPGSNQKRISEICMEEEPTHLKLGVS 84 0 

1 1 1 iii 1 1 1 ii 1 1 1 1 1 1 1 1 ill 1 1 1 ii ii Miinihii n 1 1 1 1 ii ii 1 1 in i n 

YIEAVFPGQYGIPRPWYFPCTKSYWFGEEIDEKSHPGSSQKGVSEICMEEEPTHLRLGVS 900 
I QN L VKVYRDGM KVA VDG LALN F Y EGQ ITS F LGHNG AG KTTTMS I LTG L F P PT S GT A Y I L 900 

n 1 1 1 ii ii n 1 1 1 1 iii 1 1 1 1 n 1 1 1 1 1 ii n 1 1 1 1 1 1 1 1 ii ii 1 1 1 1 1 1 1 n ii ii i 



Db 


901 


Qy 


901 


Db 


961 


Qy 


961 


Db 


1021 


Qy 


1021 


Db 


1081 


Qy 


1081 


Db 


1141 


Qy 


1141 


Db 


1201 


Qy 


1201 


Db 


1261 


Qy 


1261 


Db 


1321 


Qy 


1321 


Db 


1381 


Qy 


1381 


Db 


1441 


Qy 


1441 


Db 


1501 


Qy 


1501 


Db 


1561 


Qy 


1561 


Db 


1621 


Qy 


1621 


Db 


1681 


Qy 


1681 


Db 


1741 


Qy 


1741 


Db 


1801 


Qy 


1801 


Db 


1861 


Qy 


1861 


Db 


1921 


Qy 


1921 



901 IQNLVKVYRDGMKVAVDGLALNFYEGQITSFLGHNGAGKTTTMSILTGLFPPTSGTAYIL 960 
GKDIRSEMSTIRQNLGVCPQHNVLFDMLTVEEHIWFYARLKGLSEKHVKAEMEQMALDVG 960 

llllilll hill llllillllllllllllllllllllllllllll lllllllllllll 

GKDIRSEMSSIRQNLGVCPQHNVLFDMLTVEEHIWFYARLKGLSEKHVKAEMEQMALDVG 1020 
LPSSKLKSKTSQLSGGMQRKLSVALAFVGGSKVVILDEPTAGVDPYSRRGIWELLLKYRQ 1020 

II I II II II I ! I II I 1 1 1 1 III! II I II II M III II 1 1 1 III 1 1 1 1 II II II 1 1 I II I 

LPPSKLKSKTSQLSGGMQRKLSVALAFVGGSKVVILDEPTAGVDPYSRRGIWELLLKYRQ 1080 
GRTIILSTHHMDEADVLGDRIAIISHGKLCCVGSSLFLKNQLGTGYYLTLVKKDVESSLS 1080 

II 1 1 II 1 1 1 1 1 1 1 1 1 : 1 1 ! 1 1 MM 1 1 1 1 II 1 1 IM 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 i 1 1 1 1 1 1 1 

GRTIILSTHHMDEADILGDRIAIISHGKLCCVGSSLFLKNQLGTGYYLTLVKKDVESSLS 1140 
SCRNSSSTVSYLKKEDSVSQSSSDAGLGSDHESDTLTIDVSAISNLIRKHVSEARLVEDI 114 0 

Mllllllll 1 1 1 1 1 II M I MM 1 1 1 1 1 1 Ml II 1 1 1 M 1 1 1 II M M 1 1 1 1 M II M 

SCRNSSSTVSCLKKEDSVSQSSSDAGLGSDHESDTLTIDVSAISNLIRKHVSEARLVEDI 12 00 
GHELTYVLPYEAAKEGAFVELFHEIDDRLSDLGI SSYGISETTLEEI FLKVAEESGVDAE 1200 

MM MM M I IMMM M II II M I Mil Mill M 1 1 1 II || M I M m hMMI I 

GHELTYVLPYEAAKEGAFVELFHEIDDRLSDLGI SSYGISETTLEEI FLKVAEESGVDAE 12 60 
TSDGTLPARRNRRAFGDKQSCLRPFTEDDAADPNDSDIDPESRETDLLSGMDGKGS YQVK 1260 

MM II I I MMIIMM I i M I I ; I 1 1 1 I M II M I M II II M I M M I M M Ml 

TSDGTLPARRNRRAFGDKQSCLHPFTEDDAVDPNDSDIDPESRETDLLSGMDGKGSYQLK 1320 
GWKLTQQQ F VAL LWKRL L I ARR S RKG F FAQ I VL P A VF VC I AL VF S L I V P P FG K Y P S L ELQ 132 0 

I M I II 1 1 II MMMM I III M M I IMM I MMI ! M II II II IM M II Ml M I 

GWKLTQQQ F VAL LWKRL L I ARR S RKG F FAQ I VL P A VF VC I AL VF S L I V P P FG K Y P S L ELQ 1380 
PWMYNEQYTFVSNDAPEDTGTLELLNALTKDPGFGTRCMEGNPI PDTPCQAGEEEWTTAP 13 80 
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VPQTIMDLFQNGNWTMQNPSPACQCSSDKIKKMLPVCPPGAGGLPPPQRKQNTADILQDL 1440 
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VPQSIVDLFQNGNWTMKNPSPACQCSSDKIKKMLPVCPPGAGGLPPPQRKQKTADILQNL 1500 
TGRNI SDYLVKTYVQI IAKSLKNKI WVNEFRYGGFSLGVSNTQALPPSQEVNDATKQMKK 1500 

1 1 MM M IM MM I II I MMM 1 1 II MM I II Ml MM ll M I MM Mill 

TGRNI SDYLVKTYVQI IAKSLKNKI WVNEFRYGGFSLGVSNSQALPPSHEVNDAI KQMKK 1560 
HLKLAKDSSADRFLNSLGRFMTGLDTRNNVKVWFNNKGWHAISSFLNVINNAILRANLQK 1560 

Ml I M II I II Ml I M I : IM MM II 1 1 1 M Ml II II Mm Ml 1 1 1 M II M I 

LLKLTKDTSADRFLSSLGRFMAGLDTKNNVKVWFNNKGWHAI SSFLNVINNAI LRANLQK 1620 
GENPSHYGITAFNHPLNLTKQQLSEVAPMTTSVDVLVSICVIFAMSFVPASFWFLIQER 1620 

Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 M Ml 1 1 1 1 1 Ml 1 1 1 M II II 1 1 1! 

GENPSQYGITAFNHPLNLTKQQLSEVALMTTSVDVLVSICVIFAMSFVPASFWFLIQER 1680 
VS KAKHLQF I SGVKPVI YWLSNFVWDMCNYWPATLVI 1 1 F I CFQQKS YVS STNLPVLAL 1680 
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L LLL YGWS I T P LM Y PAS F VF K I P S T A YWLT S VN L F I G I NG S VAT F VL E L FTDN KLNN I N 174 0 
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I M Ml II III MMM Ml II Mi 1 1 II II II I III II M MM I II II II M I MM I 

DILKSVFLI FPHFCLGRGLIDMVKNQAMADALERFGENRFVSPLSWDLVGRNLFAMAVEG 1860 
WFFLITVLIQYRFFIRPRPVNAKLS PLNDEDEDVRRERQRILDGGGQNDILEIKELTKI 1860 

1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 I Ml Ml MM MUM II II Mi II II Ml I Ml Ml 

WFFLITVLIQYRFFIRPRPVKAKLPPLNDEDEDVRRERQRILDGGGQNDILEIKELTKI 1920 
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YASNYSGGNKRKLSTAMALIGGPPWFLDEPTTGMDPKARRFLWNCALSIVKEGRSWLT 2100 
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SHSMEECEALCTRMAIMVNGRFRCLGSVQHLKNRFGDGYTIWRIAGSNPDLKPVQEFFG 2160 
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LAF PGS VLKE KH RNMLQ YQLP SSLSSLAR I FS I LSQSKKRLHIEDYSVSQTTLDQVFVNF 2220 
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llll IIMM II M I M II II II MM M II II Ml M M I 
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NEWS 


18 


May 


15 
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Field Availability (/FA) field enhanced in BEILSTEIN 
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TEMA: one FREE connect hour, per account, in 
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Data available for download as a PDF in RDISCLOSURE 
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Simultaneous left and right truncation added to PASCAL 
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FROSTI and KOSMET enhanced with Simultaneous Left and Right 
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NEWS 38 AUG 18 Simultaneous left and right truncation added to ANABSTR 

NEWS EXPRESS April 4 CURRENT WINDOWS VERSION IS V6.01a # CURRENT 
MACINTOSH VERSION IS V6.0b(ENG) AND V6.0Jb(JP), 
AND CURRENT DISCOVER FILE IS DATED 01 APRIL 2003 

NEWS HOURS STN Operating Hours Plus Help Desk Availability 

NEWS INTER General Internet Information 

NEWS LOGIN Welcome Banner and News Items 

NEWS PHONE Direct Dial and Telecommunication Network Access to STN 
NEWS WWW CAS World Wide Web Site (general information) 

Enter NEWS followed by the item number or name to see news on that 
specific topic. 

All use of STN is subject to the provisions of the STN Customer 
agreement. Please note that this agreement limits use to scientific 
research. Use for software development or design or implementation 
of commercial gateways or other similar uses is prohibited and may 
result in loss of user privileges and other penalties. 

************* STN Columbus *************** 
FILE 'HOME' ENTERED AT 15:17:50 ON 10 SEP 2003 
=> file medline biosis embase caplus 

COST IN U.S. DOLLARS SINCE FILE TOTAL 

ENTRY SESSION 

FULL ESTIMATED COST 0.21 0.21 

FILE 'MEDLINE' ENTERED AT 15:18:02 ON 10 SEP 2003 

FILE 'BIOSIS' ENTERED AT 15:18:02 ON 10 SEP 2003 
COPYRIGHT (C) 2 003 BIOLOGICAL ABSTRACTS INC. (R) 

FILE ' EMBASE ' ENTERED AT 15:18:02 ON 10 SEP 2003 

COPYRIGHT (C) 2003 Elsevier Science B.V. All rights reserved. 

FILE 'CAPLUS' ENTERED AT 15:18:02 ON 10 SEP 2003 

USE IS SUBJECT TO THE TERMS OF YOUR STN CUSTOMER AGREEMENT. 

PLEASE SEE "HELP USAGETERMS" FOR DETAILS. 

COPYRIGHT (C) 2 003 AMERICAN CHEMICAL SOCIETY (ACS) 

=> s schmitz gerd /au 

LI 369 SCHMITZ GERD 

=> s klucken jochen /au 

L2 19 KLUCKEN JOCHEN 

=> s atp (p) binding (p) cassette (p) gene 

L3 3913 ATP (P) BINDING (P) CASSETTE (P) GENE 

=> s abcal 

L4 103 0 ABCA1 

=> s atp (s) binding (s) cassette (s) gene 

L5 3077 ATP (S) BINDING (S) CASSETTE (S) GENE 

=> s atp (s) binding (s) cassette (s) gene (s) protein 

L6 2046 ATP (S) BINDING (S) CASSETTE (S) GENE (S) PROTEIN 

=> d 12 total ibib 
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2003356553 MEDLINE 
22745349 PubMed ID: 12861035 

High efficacy of clonal growth and expansion of adult 
neural stem cells. 

Wachs Frank-Peter; Couillard-Despres Sebastien; Engelhardt 
Maren; Wilhelm Daniel; Ploetz Sonja; Vroemen Maurice; 
Kaesbauer Johanna; Uyanik Goekhan; Klucken Jochen 
; Karl Claudia; Tebbing Johanna; Svendsen Clive; Weidner 
Norbert; Kuhn Hans-Georg; Winkler Juergen; Aigner Ludwig 
Volkswagen-Foundation Junior Group, University of 
Regensburg , Germany . 

LABORATORY INVESTIGATION, (2003 Jul) 83 (7) 949-62. 
Journal code: 0376617. ISSN: 0023-6837. 
United States 

Journal; Article; (JOURNAL ARTICLE) 
English 

Priority Journals 
200308 

Entered STN: 20030801 

Last Updated on STN: 20030813 

Entered Medline: 20030812 
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19 BIOSIS COPYRIGHT 2003 BIOLOGICAL ABSTRACTS INC. on STN 
2003 :366865 BIOSIS 
PREV200300366865 

High efficacy of clonal growth and expansion of adult 
neural stem cells. 

Wachs, Frank-Peter; Couillard-Despres, Sebastien; 
Engelhardt, Maren; Wilhelm, Daniel; Ploetz, Sonja; Vroemen, 
Maurice; Kaesbauer, Johanna; Uyanik, Goekhan; Klucken, 
Jochen; Karl, Claudia; Tebbing, Johanna; Svendsen, 
Clive; Weidner, Norbert; Kuhn, Hans-Georg; Winkler, 
Juergen; Aigner, Ludwig (1) 

(1) Department of Neurology, University of Regensburg, 

Franz-Josef -Strauss-Allee 11, D-93053, Regensburg, Germany: 

ludwig . aigner@klinik .uni- regensburg . de Germany 

Laboratory Investigation, (July 2003, 2003) Vol. 83, No. 7, 

pp. 949-962. print. 

ISSN: 0023-6837. 

Article 

English 
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LANGUAGE : 
SUMMARY LANGUAGE 



19 BIOSIS COPYRIGHT 2003 BIOLOGICAL ABSTRACTS INC. on STN 
2001:64994 BIOSIS 
PREV200100064994 

Genomic organization and characterization of the promoter 
of the human ATP-binding cassette transporter-Gl (ABCG1) 
gene . 

Langmann, Thomas; Porsch-Oezcueruemez , Mustafa; Unkelbach, 
Uwe; Klucken, Jochen; Schmitz, Gerd (1) 
(1) Institute for Clinical Chemistry and Laboratory 
Medicine, University of Regensburg, Franz- Josef -Strauss- 
Allee 11, 93042, Regensburg: gerd. schmitz@klinik.uni- 
regensburg.de Germany 

Biochimica et Biophysica Acta, (15 November, 2000) Vol. 

1494, No. 1-2, pp. 175-180. print. 

ISSN: 0006-3002. 

Article 

English 

English 
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TITLE : 
AUTHOR (S) : 

CORPORATE SOURCE: 

SOURCE : 



DOCUMENT TYPE: 
LANGUAGE : 
SUMMARY LANGUAGE: 



Identification of a novel human sterol -sensitive 
ATP-binding cassette transporter (ABCA7 . 
Kaminski, Wolfgang E.; Orso, Evelyn; Diederich, Wendy; 
Klucken, Jochen; Drobnik, Wolfgang; Schmitz, Gerd 
(1) 

(1) Institute for Clinical Chemistry and Laboratory 
Medicine, University of Regensburg, Franz- Josef -Strauss- 
Allee 11, D-93042, Regensburg Germany 

Biochemical and Biophysical Research Communications, (July 

5, 2000) Vol. 273, No. 2, pp. 532-538. print. 

ISSN: 0006-291X. 

Article 

English 

English 



L2 ANSWER 5 OF 19 BIOSIS COPYRIGHT 2003 BIOLOGICAL ABSTRACTS INC. on STN 
ACCESSION NUMBER: 2000:113448 BIOSIS 
DOCUMENT NUMBER: PREV200000113448 

TITLE: ABCG1 (ABC8) , the human homolog of the Drosophila white 

gene, is a regulator of macrophage cholesterol and 

phospholipid transport. 
AUTHOR (S) : Klucken, Jochen; Buechler, Christa; Orso, Evelyn; 

Kaminski, Wolfgang E.; Porsch-Oezcueruemez , Mustafa; 

Liebisch, Gerhard; Kapinsky, Michael; Diederich, Wendy; 

Drobnik, Wolfgang; Dean, Michael; Allikmets, Rando; 

Schmitz, Gerd (1) 

CORPORATE SOURCE: (1) Institute for Clinical Chemistry and Laboratory 

Medicine, University of Regensburg, Franz- Josef -Strauss- 

Allee 11, D-93042, Regensburg Germany 
SOURCE: Proceedings of the National Academy of Sciences of the 

United States of America, (Jan. 18, 2000) Vol. 97, No. 2, 

pp. 817-822. 

ISSN: 0027-8424. 
DOCUMENT TYPE: Article 
LANGUAGE : Engl i sh 

SUMMARY LANGUAGE: English 

L2 ANSWER 6 OF 19 BIOSIS COPYRIGHT 2003 BIOLOGICAL ABSTRACTS INC. on STN 
ACCESSION NUMBER: 2000:96559 BIOSIS 
DOCUMENT NUMBER: PREV2 000000 96559 

TITLE: Regulation of scavenger receptor CD163 expression in human 

monocytes and macrophages by pro- and antiinflammatory 
stimuli . 

AUTHOR (S) : Buechler, Christa; Ritter, Mirko; Orso, Evelyn; Langmann, 

Thomas; Klucken, Jochen; Schmitz, Gerd (1) 

CORPORATE SOURCE: (1) Institut fuer Klinische Chemie und 

Laboratoriumsmedizin, Klinikum der Universitaet Regensburg, 
Franz- Josef -Strauss-Allee 11, D-93042, Regensburg Germany 

SOURCE: Journal of Leukocyte Biology, (Jan., 2000) Vol. 67, No. 1, 

pp. 97-103. 
ISSN: 0741-5400. 

DOCUMENT TYPE: Article 

LANGUAGE : English 

SUMMARY LANGUAGE: English 

L2 ANSWER 7 OF 19 BIOSIS COPYRIGHT 2003 BIOLOGICAL ABSTRACTS INC. on STN 
ACCESSION NUMBER: 1999:386551 BIOSIS 
DOCUMENT NUMBER: PREV1999003 8655 1 

TITLE: The gene encoding ATP-binding cassette transporter 1 is 

mutated in Tangier disease. 
AUTHOR(S): Bodzioch, Marek; Orso, Evelyn; Klucken, Jochen; 

Langmann, Thomas; Boettcher, Alfred; Diederich, Wendy; 

Drobnik, Wolfgang; Barlage, Stefan; Buechler, Christa; 

Porsch-Oezcueruemez, Mustafa; Kaminski, Wolfgang E.; 
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DOCUMENT TYPE: 
LANGUAGE : 
SUMMARY LANGUAGE: 
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Hahmann, Harry W.; Oette, Kurt; Rothe, Gregor; Aslanidis, 
Charalampos; Lackner, Karl J.; Schmitz, Gerd (1) 
(1) Institute for Clinical Chemistry and Laboratory 
Medicine, University of Regensburg, D-93042, Regensburg 
Germany 

Nature Genetics, (Aug., 1999) Vol. 22, No. 4, pp. 347-351. 

ISSN: 1061-4036. 

Article 

English 

English 

19 BIOSIS COPYRIGHT 2 003 BIOLOGICAL ABSTRACTS INC. on STN 
1999:247874 BIOSIS 
PREV199900247874 

Molecular cloning of the human ATP-binding cassette 
transporter 1 (hABCl) : Evidence for sterol -dependent 
regulation in macrophages . 

Langmann, Thomas; Klucken, Jochen; Reil, Markus; 
Liebisch, Gerhard; Luciani, Marie-Francoise ; Chimini, 
Giovanna; Kaminski, Wolfgang E.; Schmitz, Gerd (1) 
(1) Institute for Clinical Chemistry and Laboratory 
Medicine, University of Regensburg, Franz- Josef -Strauss- 
Allee 11, D-93 042, Regensburg Germany 

Biochemical and Biophysical Research Communications, (April 
2, 1999) Vol. 257, No. 1, pp. 29-33. 
ISSN: 0006-291X. 
Article 
English 
English 

19 BIOSIS COPYRIGHT 2003 BIOLOGICAL ABSTRACTS INC. on STN 
1997:522758 BIOSIS 
PREV199799821961 

Scavenging, signalling and adhesion coupling in 
macrophages: Implications for atherogenesis . 
Schmitz, Gerd (1); Orso, Evelyn; Rothe, Gregor; 
Klucken, Jochen 

(1) Inst. Clinical Chem. Lab. Med., Univ. Regensburg, 
Franz-Josef -Strauss -Allee 11, D-93053 Regensburg Germany 
Current Opinion in Lipidology, (1997) Vol. 8, No. 5, pp. 
287-300 . 

ISSN: 0957-9672. 
General Review 
English 

19 BIOSIS COPYRIGHT 2003 BIOLOGICAL ABSTRACTS INC. on STN 
1997:61424 BIOSIS 
PREV199799360627 

Peripheral blood mononuclear phagocyte subpopulations as 
cellular markers in hypercholesterolemia. 
Rothe, Gregor; Gabriel, Holger; Kovacs, Eva; Klucken, 
Jochen; Stoehr, Josef; Kindermann, Wilfried; Schmitz, 
Gerd 

Inst. Clincial Chemistry Lab. Med., Univ. Regensburg, 
Franz- Josef -Strausse-Allee 11, D-93053 Regensburg Germany 
Arteriosclerosis Thrombosis and Vascular Biology, (1996) 
Vol. 16, No. 12, pp. 1437-1447. 
ISSN: 1079-5642. 
Article 
English 
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Genomic organization and characterization of the 
promoter of the human ATP-binding cassette 
transporter-Gl (ABCG1) gene 

Langmann, Thomas; Porsch-Ozcurumez , Mustafa; 
Unkelbach, Uwe; Klucken, Jochen; Schmitz, 
Gerd 

Institute for Clinical Chemistry and Laboratory 
Medicine, University of Regensburg, Regensburg, 93042, 
Germany 

Biochimica et Biophysica Acta (2000), 1494(1-2), 
175-180 

CODEN: BBACAQ; ISSN: 0006-3002 
Elsevier Science B.V. 
Journal 
English 

2 6 THERE ARE 2 6 CITED REFERENCES AVAILABLE FOR THIS 
RECORD. ALL CITATIONS AVAILABLE IN THE RE FORMAT 
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CAPLUS COPYRIGHT 2 003 ACS on STN 
2000 : 426543 CAPLUS 
133 :220696 

Identification of a Novel Human Sterol-Sensitive 

ATP-Binding Cassette Transporter (ABCA7) 

Kaminski, Wolfgang E.; Orso, Evelyn; Diederich, Wendy; 

Klucken, Jochen; Drobnik, Wolfgang; Schmitz, 

Gerd 

Institute for Clinical Chemistry and Laboratory 
Medicine, University of Regensburg, Regensburg, 
D-93042, Germany 

Biochemical and Biophysical Research Communications 
(2000) , 273 (2) , 532-538 
CODEN: BBRCA9; ISSN: 0006-291X 
Academic Press 
Journal 
English 
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2000 : 227775 CAPLUS 
132 :275181 

ATP-binding cassette genes and proteins for diagnosis 
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